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Chapter 1 



Introduction 



1.1 Why writing these Notes? 

The occasion, or I should better say the pretext, for writing these Notes is a 
graduate course delivered at SISSA (International School for Advanced Studies, 
Trieste) in 2012 and covering a part — and a part only — of the topics dealt with 
here. 

The need for some support material to this course is hardly compelling, since 
many very good review papers exist, whose union covers about the whole of the 
present topics, all of them state-of-the-art at the time of their publication. Some 
of these reviews are authored or coauthored by me [1, 2, 3, 4, 5, 6, 7, 8, 9], and 
many more by outstanding colleagues. Of these, I quote only the most recent 
ones; a non exhaustive list is [10, 11, 12, 13, 14, 15, 16, 17, 18]. 

The writing of a set of Lecture Notes in due form appears as a heavy burden, 
diverting for a significant amount of time the author from his day-to-day research 
work, already also diverted by administration and teaching. Instead, I used 
above the word "pretext", because these notes are mostly written for my own 
sake. In fact I regard such burden as an important occasion for a pause of 
reflection, devoted to critical rethinking about the subject in its wholeness. In 
the present case, the pause of reflection was made possible by an extended stay 
(four months) at CECAM in Lausanne. 

I wrote Lecture Notes in four other occasions, last time in 2000 [19]. I 
know by experience that the task of writing in uniform notations and in a 
logical sequence many results scattered in the literature leads me to scrutinize 
from a novel viewpoint not only other people's work, but even my own. Here 
I^Tj^X plays a major role; soon after its appearance in the early 1980s my way 
of thinking has changed dramatically. While previously I — like everybody — 
reasoned by jotting formulas on a sheet of paper, nowadays I cannot reason 
clearly unless the formulas are neatly typeset in LT^X. 

The very fact of writing many different known results altogether has the 
beneficial effect of making quite evident several links, hitherto unsuspected. 
The perspective view gained while writing the present Notes (and typesetting 
them in LTJrN.) will doubtless influence the future course of my research activity, 
as in fact already happened in the previous occasions. 
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Figure 1.1: The hallmark of topology, 
as in many popular presentations. Hun- 
dreds of figures like this, and even some 
very perspicuous videos, can be down- 
loaded from the internet. The two 
closed surfaces ("two-dimensional com- 
pact manifolds") have the same topo- 
logical invariant g = 1, which measures 
the number of handles. 

1.2 What topology is about 

Topology is defined as a branch of mathematics that describes properties which 
remain unchanged under smooth deformations; such properties are usually la- 
belled by integer numbers, named topological invariants. The concepts and 
tools belonging to topology are continuity and connectivity, open and closed 
sets, neighborhoods, and the like. 

Differentiability, or even a metric structure, are not needed; theorems are 
proved under very general hypotheses, and are therefore very powerful, being 
applicable to very diverse frameworks. The tradeoff is that proofs, and even def- 
initions, look clumsy and obscure to readers with the mathematical background 
of a typical condensed matter physicist. The good news is that the topological 
properties most relevant for electronic structure theory can be formulated in the 
more familiar language of differential geometry. 

Many introductions to topology start with the statement that, to a topolo- 
gist, a coffee cup and a doughnut are the same thing, as in Fig. 1.1. Intuitively, 
the common feature of the two objects is the presence of one, and only one, 
handle. The mathematical definition of "handle" is coming soon. 

1.2.1 Gauss-Bonnet theorem 

We start with the simplest example, a sphere, and a tangent plane at a given 
point. In a local system of Cartesian coordinates on the plane the equation of 
the sphere is 

z = i?-v/i? 2 -* 2 -^^^, (LI) 
and the Hessian matrix is 

/ afz &z_ \ ( i/R o \ 

h=\ p; d f z v = . (i.2) 

\ Hy 1 j \ 1/^ / 

The Gaussian curvature ft is by definition the determinant of the Hessian at the 
tangency point. It is obviously constant and equal to 1/R 2 at any point of the 
sphere; notice that the orientation of the z axis (either inwards or outwards) is 
irrelevant. The integral of ft over the whole closed surface is 4ir. 

Next we consider a smooth (i.e. twice differentiable) closed surface of arbi- 
trary shape: the Gaussian curvature is defined as the determinant of the Hessian 
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Figure 1.2: A sphere of radius R, and 
its tangent plane in a generic point. The 
Gaussian curvature in this trivial case is 



n = i/r 2 . 



at the tangent plane, similarly to what we did for the sphere: 



ft = dct 




(1.3) 



In general, f2 can be positive, negative (at a saddle point), or zero (e.g. for 
a plane or a cylinder). The Gauss-Bonnet theorem states that for any closed 
smooth surface 



where g is a nonnegative integer, called the "genus" of the surface. Surfaces 
which can be continuously deformed into each other (i.e. "homcomorphic" ) 
have the same genus. For the sphere and any surface homeomorphic to it g = 0; 
both the coffee cup and the doughnut, Fig. 1.1 have g = 1; a double-handle 
cup has g = 2. The genus is thus the mathematical definition for the number 
of handles. 

1.2.2 Euler characteristic 

We have considered smooth surfaces so far, but topological invariants are 
based on the more general condition of continuity, and — to the delight of 
mathematicians — unsurprisingly can be defined even for pathological surfaces 
("manifolds" in topology-speak). The simplest non smooth case addresses poly- 
hedra, where the Gaussian curvature is either zero (on the faces) or singular (at 
vertices and edges). 

The Euler characteristic is defined as \ = V ~~ E + F> where V is the num- 
ber of vertices, E is the number of edges, and F is the number of faces. If we 
address the set of regular polyhedra (tetrahedron, cube, octahedron, dodeca- 
hedron, icosahedron) it is easily verified that X = 2. All these surfaces can be 




(1.4) 




Figure 1.3: A doughnut shaped polyhe- 
dron. This surface has Euler character- 
istic x = or, equivalently, genus g = 1. 
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continuously deformed into ("are homcomcorphic to") each other, and into a 
sphere. In fact there is a one-to-one relationship between the Euler character- 
istic and the genus: \ = 2(1 — g). Polyhcdra can also have x / 2, like the 
doughnut-shaped one shown in Fig. 1.3. 

1.3 Electronic wavefunctions 

In the domain of electronic structure, the typical object addressed via geomet- 
rical and/or topological concepts is the electronic ground state of some system. 
Whenever an observable effect has the nature of a topological invariant, i.e. 
it is an integer number, two remarkable features occur. (1) The observable 
is measurable in principle with infinite precision (1CT 9 is actually attained for 
the quantum Hall effect). (2) The observable is very robust under even strong 
variations of the sample conditions; a very disruptive perturbation is needed to 
switch from one integer to another. Topology concerns mostly insulators: in 
this case the disruptive perturbation amounts to crossing a metallic state. 

These Notes are entirely devoted to physical properties having a topological 
and/or geometrical character. I am not sure of always using the right seman- 
tics. Loosely speaking, I would use the term "topological" for something which 
is quantized, and "geometrical" for something which is not. The framework 
and the mathematical tools are often the same for quantized and nonquantized 
quantities, the former frequently occurring as special cases of the latter. 

The Berry phase is the typical geometrical quantity which is not quantized, 
although it can be quantized in high-symmetry cases. The macroscopic polar- 
ization of a solid is a Berry phase, and is obviously (from an experimental view- 
point) a nonquantized observable. Nonetheless, there are aspects of the modern 
theory of polarization that I would define topological. The same applies to other 
geometrical properties considered in this Notes; it is reassuring that even other 
authors often use "geometrical" and "topological" as synonymous. 

Finally, a few words about the many calculations cited and sometimes briefly 
outlined here. Unless otherwise stated, the term "first-principle calculations", 
when referred to a condensed matter system, means density functional calcula- 
tions; independent-electron eigenfunctions and eigenvalues are the Kohn-Sham 
(KS) ones. Despite these Notes mostly address a computational physics reader- 
ship, no technical details are given (basis sets, pseudopotcntials, functionals...); 
they are obviously detailed in the original literature, while the focus here is on 
the physical properties. 



We use Gaussian electromagnetic units throughout: these have the advantage 
(at variance with SI units) that electric and magnetic fields have the same dimen- 
sions. Furthermore, the nasty £o and /xo disappear; SI formulas are converted 
by setting 47re = 1 and 47r//i = 1. 

For a single particle, the Newton equation of motion and the Hamiltonian 
read, respectively 



1.4 Units 




(1.5) 
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H=^(v-^A{v)j + QHv). (1.6) 

Generally, Gaussian electromagnetic units are associated to mechanical cgs 
units, but this is no means necessary. In electronic structure theory, it is ex- 
pedient to associate Gaussian electromagnetic units with atomic units (a.u.), 
defined as e 2 = 1, m e = 1, h = 1. The unit of energy is the hartree (1 Ha = 2 
Ry = 27.21 cV). In the present Notes the electron charge is — e, with e > 0; this 
sign choice agrees with most (but not all) the recent literature. For instance, 
the very popular review of Ref [1] adopts e < 0. 

The speed of light in a.u. is c — 137. This immediately hints at why the 
largest atomic number Z in the periodic table is Z ~ 100: in fact the core 
electrons have (in a.u.) energies of the order of Z 2 , hence velocities of the order 
of Z. 



1.5 Symbols 

I am faced here with two contrasting issues: adopting the symbols most currently 
used in the literature, and adopting different symbols for different objects. This 
proved to be near to impossible in a work of the present kind, if baroque symbols 
are to be ruled out. For instance along the present Notes I do use A, A, A, A., srf ', 
all with a different meaning. Similarly, I use P, P,P,V, Despite this, I 
found unavoidable to use — in different Chapters — the same symbol for different 
objects. For instance, depending on the context, the symbol P may indicate 
a projector or, otherwise, a one-dimensional electrical polarization. Another 
example is the symbol H used for the Berry curvature, while fti is the gauge- 
invariant quadratic spread of the Wannier functions. Therefore caution is in 
order when extrapolating a given symbol from its own context. 



1.6 Gauge and flux 



We consider here a simple exercise which plays the role of a very important 
paradigm; it illustrates basic concepts and results which are going to reappear 
several times all along the present Notes. 

We address the single-particle Hamiltonian 

H=±-(p + ^Af + V(r), (1.7) 

where the vector potential A is independent of space and time. It is usually 
said that A is a pure gauge, meaning with this that it does not affect the fields: 

1 8A 

B = VxA = 0, E = =- = 0. (1.8) 

c dt y 1 

1.6.1 Classical mechanics 

Let us first adopt a classical viewpoint. The Hamilton equation of motions are 

p - — =-VF(r) (1.9) 

r = |^ = -(p + -A). (1.10) 
Op m c 
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From these we get 

p = mr A, (1-11) 

c 

which leads to the Newton equation of motion 

mr=-W(r). (1.12) 

The bottom line looks quite obvious: a pure gauge has no effect. A basic tenet 
of classical mechanics is that the equations of motion can always be directly 
expressed in terms of the forces (i.e. the fields), while the potentials — scalar 
and vector — are auxiliary quantities, devoid of physical meaning. 

1.6.2 Quantum mechanics, open boundary conditions 

Next we switch to quantum mechanics. It is expedient to rewrite Eq. (1.7) as 

H( K ) = ^-(p + hK) 2 + V(r), «=J-A, (1.13) 

where k, having the dimensions of an inverse length, will be referred to as 
"twist" in the following. The Schrodinger equation is 

H(K)\Tjj n (K)) =e n ( K )|V>„(K)). (1.14) 

The eigenvectors and eigenvalues of the Schrodinger equation depend on the 
boundary conditions assumed. 

The so-called open boundary conditions (OBCs) require that the bound 
eigenstates are square-integrable over R 3 . Let \ip n {0)} be a nondegenerate 
eigenstate of the "untwisted" Hamiltonian within OBCs. Then the state 
e~* K ' T \ip n (0)) obviously obeys OBCs as well, and also obeys Eq. (1.14) with 
a /t-independent eigenvalue. Therefore it coincides with the n-th eigenstate 
\ip n (K)) of the twisted Hamiltonian; notice that this eigenstate is arbitrary by 
a ^-dependent phase factor. 

We conclude that a pure gauge within OBCs affects the wavefunction, but 
does not affect any of the observable quantities, such as expectation values, 
density, and current. We spell this, in jargon, by saying that the "twist" is 
easily "gauged away" within OBCs. 

1.6.3 Quantum mechanics, periodic boundary conditions 

We assume periodic boundary conditions (PBCs) over a cubic box of side L, i.e. 
we require the eigenstates of Eq. (1.14) to be Born-von-Karman periodic with 
period L over x, y, and z at any given n. Each Cartesian coordinate is therefore 
equivalent to an angle, e.g. ip x = 2nx/L. 

If |^ n (0)) is an eigenstate of the untwisted Hamiltonian within PBCs, then 
the state e~ lK, ' v \ip n (0)} obeys Eq. (1.14) with a K-independent eigenvalue, but 
for a general n it does not obey PBCs, and therefore in general does not coincide 
with the genuine eigenstate \ip n (K)). Within PBCs the spectrum of Eq. (1.14) 
depends on the twist k in a nontrivial way. 

If |^„(k)) is an eigenstate of Eq. (1.14) within PBCs with eigenvalue e„(/e), 
then the auxiliary state \i/j n (K)} — e lK ' r \ip n (i^)} obeys the untwisted (k = 0) 
Schrodinger equation, and quasi-periodical (a.k.a. "twisted" or "skewed") 
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boundary conditions: at any two opposite faces of the cube the wavefunction 
differs by a /^-dependent phase factor. 

In other words the problem can be formulated in two equivalent ways: either 
the Hamiltonian is /^-dependent, as in Eq. (1.14), and the boundary conditions 
are K-indepcndcnt; or the Hamiltonian is /^-independent but the boundary con- 
ditions are "twisted" in a K-dependent way. 

1.6.4 Example: Free particle in Id 

For the sake of simplicity, we consider Eq. (1.14) in Id, and with V = 0: 
lr ( . d ^ 2 



2m\' i dx +K ) = e "( K )l^«( K )>- ( L15 ) 
The eigenfunctions within PBCs and the spectrum are 

(a#„(«0> oc e^ x , n e Z, (1.16) 

en(«) = w~ + > ( L17 ) 



2m V L 

where the nontrivial K-dependence is perspicuous. The velocity operator can be 
written as 

and the Hcllmann-Feynman theorem yields 

mv^n) = \ d -^. (1.19) 

n an 

We have introduced PBCs as a basic framework of condensed matter physics. 
Many concepts (like the Bloch vector or the Fermi surface) make sense only 
within PBCs. But we also may regard this problem as if the electrons were 
confined to a circular rail of circumference L, as in Fig. 1.4. There is no field 
(electric or magnetic) on the rail, but a constant vector potential of intensity 
A = chn/e is present along the rail; eigenvectors and eigenfunctions depend on 
its value. 




Figure 1.4: The electron motion is con- 
fined to a circular rail. A constant vec- 
tor potential A = chn/e along the rail, 
as in Eq. (1.15), corresponds to vanish- 
ing fields (electric and magnetic), yet 
the spectrum depends on the "inaccessi- 
ble flux" threading the surface encircled 
by the rail. 
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1.6.5 Flux and flux quantum 

The constant vector potential A on the circular rail corresponds to a magnetic 
flux (f> = LA threading the surface encircled by the rail, in a region not vis- 
ited by the electronic system; it has been appropriately called by some authors 
"inaccessible flux" . 

We further observe that the spectrum, Eq. (1.17), is periodic in k with period 
2ir/L; alternatively, it is periodic in the flux <j> with period 4>q = 2nhc/e = hc/e, 
the elementary flux quantum. In cgs units hc/e = 4.135 x 10 -7 gauss cm 2 , while 
in SI units 4>o — h/e — 4.136 x 10~ 15 Wb. Notice also that, in the framework 
of superconductivity, the same symbol <f>o indicates one half of this (it refers to 
electron pairs). 

We stress that only the fractional part of the flux affects the results in a 
nontrivial way. This is perspicuous if we recast Eq. (1.17) as 

The flux breaks time-reversal symmetry (k — > — /c), and the spectrum is non- 
degenerate, except when <j> = or <f> = <f>o/2, the latter also called "7r flux". In 
these two cases (and in these cases only) the eigenfunctions can be chosen as 
real. 

When the flux is varied with time, an cmf is induced along the loop. Using 
Eq. (1.19), the current is 

I=-ev = -c^. (1.21) 

d<p 

This result is remarkable: it holds even in presence of a potential V(x), and 
generalizes straightforwardly to N noninteracting electrons. It will be used in 
the discussion of the quantum Hall effect: see Eq. (2.17) below. 
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Chapter 2 



Early discoveries 



2.1 The Aharonov-Bohm effect: A paradox? 

The Aharonov-Bohm effect is the paradigm for a measurable effect induced by 
an inaccessible flux. We anticipate that in many other phenomena such flux may 
be purely "geometrical" or "topological" , without any relationship to a genuine 
magnetic field: this is e.g., the case considered in the next Section, ft is only 
in the Aharonov-Bohm effect that one addresses indeed the inaccessible flux of 
a magnetic field, as present e.g. inside a solenoid. An interference experiment 
detects the presence of the flux even when the electronic motion is confined in 
the region outside the solenoid, where the magnetic field is zero. This seems 
paradoxical: something which "happens" in a region not visited by the quantum 
particle may affect some observable properties. Indeed, the founding fathers of 
quantum mechanics (in the 1920s) failed to notice such peculiar feature. It only 
surfaced more than 30 years afterwards in the milestone paper by Aharonov and 
Bohm [20], appeared in 1959, whose abstract states verbatim "...contrary to the 
conclusions of classical mechanics, there exist effects of potential on charged 
particles, even in the regions where all the fields ( and therefore the forces on the 
particles) vanish". 

The paper was shocking, and its conclusions were challenged by several au- 
thors; nonetheless experimental validations appeared as early as 1960 [22, 23]. 
The main message of Ref. [20] is at the basis of many subsequent developments 




Figure 2.1: The Aharonov-Bohm interference experiment (From Ref. [21]) 



9 



in electronic structure theory, many of them illustrated below in the present 
Notes. The Aharonov-Bohm effect is also at the root of the commercial SQUID 
technology [24]. 

It is remarkable that R. P. Feynman included the Aharonov-Bohm effect 
in his legendary lectures, delivered to the sophomore class at Caltech during 
the 1962-63 academic year [21]. In the final sentence about this topic, Feynman 
says: "...E and B are slowly disappearing from the modern expression of physical 
laws; they are being replaced by A and <& ". 

It is also remarkable and shameful that a paper bearing the title "Nonex- 
istence of the Aharonov-Bohm effect" [25] was published as late as 1978. All 
challenges disappeared with the publication in 1984 of the celebrated paper by 
Michael Berry (now Sir Michael Berry [26] ) , where the eponymous phase made 
its first appearance [27]. 

2.2 Conical intersections in molecules 

At the time Berry wrote his famous paper, only two occurrences of a geometrical 
phase (called Berry phase soon afterwards) in quantum mechanics were known to 
him: the Aharonov-Bohm effect and a somewhat exotic phenomenon occurring 
in molecular physics. Even the latter was known since the late 1950s [28, 29], 
and appropriately rebaptized in the late 1970s as "molecular Aharonov-Bohm 
effect" [30, 31]. In the subsequent years Berry phases were discovered in many 
branches of physics. 

The smallest molecular system where the molecular Aharonov-Bohm effect is 
possible is a trimer, having three internal coordinates (e.g. the three internuclear 
distances), and the simplest trimers are of course the homonuclear ones, where 
symmetry plays a major role. I give a simple outline for this particular system: 
a dynamical Jahn- Teller effect, bearing the conventional symmetry label E®e. 

We focus on a trimer of monovalent atoms, e.g. H3 or Na3, and we assume 
an independent-electron picture in the Born-Oppcnheimcr approximation. We 
start with the molecule in the equilateral configurations, Fig. 2.2. Two of the 
valence electrons occupy a totalsymmetric orbital, while the unpaired electron 
occupies the next available one, which has E symmetry and is doubly degener- 
ate. In a simple tight-binding (alias minimal-basis LCAO) scheme, a possible 
basis in the two-dimensional manifold is: 

|1> = |B)-|C) ) ; |2> = -L( 2|A)-|B)-|C) ) , (2.1) 

where A,B,C are atomic labels (as in the figure). This choice deserves an impor- 
tant comment. We are adopting OBCs, as appropriate for an isolated molecule, 
and the Hamiltonian is invariant under time reversal (no magnetic field, no 
spin-orbit interaction). These two conditions guarantee that the orbitals may 
always be chosen as real. They may, but they don't need: it may instead be 
convenient to choose a complex basis in the same two-dimensional degenerate 
manifold. 




Figure 2.2: A homonuclear trimer in its equilateral configu- 
ration. 
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Figure 2.3: A schematic representation of a (counterclockwise) pseudorota- 
tion, where subsequent snapshots differ by 2ir/3. The corresponding electronic 
ground states in the tight-binding approximation, are also shown. 

When we distort the molecule from its equilateral configuration, the doublet 
is linearly split: one of the two components is energetically favored, the molecule 
undergoes Jahn- Teller distortion, and the electronic ground state in the Born- 
Oppcnhcimcr approximation becomes nondcgcncratc. 

Next we analyze the motion of the nuclei. There are three linearly inde- 
pendent normal modes for the small oscillations of the internal coordinates. Of 
course, in absence of a Jahn-Teller effect, the equilateral configuration is the 
equilibrium one. One mode is totalsymmetric, and cannot split the electronic 
levels. The remaining modes are degenerate, having in fact E symmetry, and 
couple to the electronic doublet, originating in fact the dynamical Jahn-Teller 
effect. The notation E®e means indeed that an E vibrational mode is coupled 
to an E electronic state: conventionally, one uses upper case letters as symmetry 
labels for the vibrational states, and lower case ones for the electronic states. 

The adiabatic electronic ground state follows the nuclear motion. For a 
cyclic pseudorotation, shown in Fig. 2.3, the Hamiltonian is periodical, but the 
electronic wavefunction is antiperiodical. The total wavefunction in the Born- 
Oppcnhcimer approximation factors into the electronic one times the nuclear 
one. Given that the total wavefunction must be single-valued, even the nuclear 
wavefunction must be quantized using antiperiodical boundary conditions, and 
this affects the pseudorotation spectrum in a measurable way. 

This feature has to do with the peculiar shape of the Born-Oppenheimer 
surface, shown in Fig. 2.4. If we adopt a two-dimensional Cartesian normal 
coordinate £ = (^1,^2), the ionic displacements are parametrized as: 

= 6 

V3 C 1 C 



Figure 2.4: The Born-Oppcnhcimcr sur- 
face of the Jahn-Teller split doublet: a 
double-valued function with a conical 
intersection. The potential minimum is 
a circle of radius £ m ; n centered at the 
degeneracy point. 
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The meaning of this coordinate choice is transparent with reference to Fig 2.3: 
when the atom A is displaced by £, the displacements of B and C are of equal 
magnitude |£|, but pointing in directions rotated by — 27r/3 and — 47r/3, re- 
spectively. If we neglect Jahn- Teller coupling beyond linear order, no potential 
energy is associated to a motion at constant |£|, which is indeed a free pseudoro- 
tation (or a "rotation wave"), also schematized in the succession of snapshots 
in Fig. 2.3. 

In absence of Jahn- Teller coupling, the surface would simply be a parabola, 
everywhere doubly degenerate. The linear Jahn- Teller splitting is function of 
|£|, hence to linear order the electronic eigenvalues are: 



This double- valued function is displayed in Fig. 2.4 and has a conical inter- 
section at the origin. The lowest sheet E_ (£) has a circular valley of radius 
£min, where a classical particle travels freely (if nonlinear Jahn- Teller coupling 
is neglected). Nothing exotic happens if the nuclear motion can be considered 
as classic; but when we quantize the nuclear degrees of freedom, anti-periodical 
boundary conditions have to be imposed for the cyclic motion, as said above. 
A simple approximation for the rotovibrational levels is thus: 



corresponding to an oscillation of frequency u>o and quantum number u, and 
a two-dimensional internal rotation with rotor constant A. The antiperiodical 
boundary conditions imply half-odd- integer values for the quantum number j. 
The pseudorotation term in the spectrum can be compared to Eq. (1.20); the 
moment of inertia in the prefactor becomes obviously a nuclear rotor, but the 
spectrum is the same if we identify the inaccessible flux <ft with half a flux 
quantum <j> (a.k.a. n flux). 

There is no magnetic field in this problem; the flux is purely topological 
and can be regarded as an obstruction: the nuclear path cannot be contracted 
without crossing a degeneracy point. It is remarkable that the topological nature 
of this problem was clearly stated as early as 1963 — much earlier than topology 
became fashionable in electronic structure — by Herzberg and Longuct-Higgins 
[29], who say verbatim: "...a conically self-intersecting potential surface has a 
different topological character from a pair of distinct surfaces which happen to 
meet at a point. Indeed, if an electronic wave function changes sign when we 
move round a closed loop in configuration space, we can conclude that somewhere 
inside the loop there must be a singular point at which the wave function is 
degenerate". 

In modern jargon, we would say that the cases <f> = and <f> = (f>o/2 are topo- 
logical^ distinct; owing to time-reversal invariance, other flux values are ruled 
out. The present paradigm also illustrates the robustness of topological prop- 
erties against smooth deformations. For instance, here we have addressed the 
ultrasimplc tight-binding model, but the ground wavefunction can be "contin- 
uously deformed" to the exact correlated wavefunction: topology-wise, the two 
wavefunctions are essentially the same object, insofar as the conical intersection 
is present. Notice also that at the conical intersection the Born-Oppenheimer 
approximation breaks down. 



E±{£) oc HI 2 ± const || 



(2.2) 



E(u,j) = (u+-)cj + Aj 2 



(2.3) 
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One could also address more general closed paths, according to their winding 
number round the obstruction. Only paths having the same winding number 
can be continuously deformed into each other: they are "homotopic" . 

2.3 Quantization of the surface charge 

The pioneering selfconsistent calculations of the electronic structure of surfaces, 
performed at IBM (Yorktown Heights) and at Bell Labs in the mid 1970s, 
pointed out the occurrence of quantization of charge at insulating surfaces. 
After an early paper by V. Heine in 1966 [32], the theorem made its appearance 
in a 1974 paper by Appelbaum and Hamann [33]. Other papers addressed the 
issue in the 1970s [34, 35], but the topological explanation came much later; it 
will be discussed below, Sec. 5.3.4. 

The quantization of surface charge may appear counterintuitive, if one sticks 
at the idea that a solid is an array of classical charges (ions), as many people 
still do. Possibly because of its counterintuitive content, this important theorem 
is surprisingly ignored even by well known specialists in surface physics. The 
extreme of such ignorance occurs in a recently published invited review paper — 
which I abstain from quoting — about polar surfaces. The theorem is even more 
ignored in quantum chemistry, where it addresses end charges in linear polymers. 

Electrons are quantum particles, and classical ideas may prove wrong. Solids 
are not assemblies of ions; they are assemblies of atoms, having ionic character 
only because neighboring atoms have a different electronegativity [36]. At the 
surface, one has to look at what happens to the bonds. 

A simple statement of the theorem is the following. If the bulk of the crystal 
is centrosymmetric, and if the surface is insulating, then the charge per surface 
cell may only be an integer or half integer; the surface charge can be nonquan- 
tized only if the bulk is noncentrosymmetric, or if the surface is metallic. 

Quite often, the actual quantized value is zero because of energy consider- 
ations; therefore even polar surfaces are (counterintuitively) neutral under the 
above two essential hypotheses, which I stress again: the bulk is centrosymmet- 
ric, and the surface is insulating. The microscopic mechanism can be understood 
as an intrinsic surface-state neutralization [36]; however, topology guarantees 
quantization independently of microscopic details. 

We nowadays regard bulk-surface correspondence in many phenomena as 
one of the hallmarks of geometry and topology in condensed matter physics. In 
modern jargon, I would say that the surface charge of insulators is "topologically 
protected" . 

2.4 Integer quantum Hall effect 
2.4.1 Classical theory (Drude-Zener) 

We consider any 2d system, in the setup shown in Fig. 2.5. If dissipation is 
accounted for by a single relaxation time r, the Newton equation of motion for 
a single carrier of mass m and charge — e, is 
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Setting dv/dt = we get the steady-state solution: 

v = -— (e+-vxbY (2.5) 
m \ c J 

In terms of the cyclotron frequency u c — ^ the solution with E y = is 

er 

v x = E x - UJ C TV X 

m 

V y = LO c TV x . (2.6) 

If n is the carrier density the current is j = — nev: 

ne 2 r 

Jx = E x - U! C TJ X 

m 

j y = w c rj x . (2.7) 
in zero B field we retrieve the standard Drude (diagonal) conductivity: 

ne 2 r 

3x=&ohx, c = , (2.8) 

m 



while for B ^ the conductivity tensor is: 



j - E = a E 



_ U C T<T 

\ + (yj t) 2 X ~ X 



Inversion of the conductivity tensor 

&xx @yx /„ -. n \ 

Pxx = — Pxy = 2 , 2 2.10) 

^ xx 1 yx ^ xx 1 yx 

provides a remarkably simple expression for the longitudinal and transverse 
resistivity 

m muj c 1 

Pxx = I/CO = 5", Pxy = 5" = D. (2-11) 

ne z r ne z nec 

The Hall resistivity is therefore linear in B and independent of both mass and 
relaxation time; more accurately, since we may consider even carriers of positive 
charge e ("holes"), its sign does depend on the carrier charge. Notice also that 
in the nondissipative regime (t l/u> c ) both p xx and a xx vanish. 
If we write n as N/A (number of carriers per unit area), then 

- = -L (2 - 12) 



Figure 2.5: Hall effect in a 2d system. 
The E field is applied along x, while the 
B field is along z. The system is shorted 
in the y direction; the current j has both 
longitudinal (x) and transverse, a.k.a. 
Hall (y) components. 
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where <p = AB is the magnetic flux through area A. Although we are still at a 
purely classical level, it is instructive to multiply and divide p xy by h. We may 
thus identically write 

Pxy = 2' T- 2 ' 13 ) 

Here cf>o = hc/e is the flux quantum, as defined above. The dimcnsionless 
quantity v, called the filling factor, equals the ratio between the number of 
electrons N and the number of flux quanta 4>/4>o- Eq. (2.13) expresses the 
transverse resistivity in terms of the natural resistance unit h/e 2 . Since 1990 
this is a new metrology standard, accurate to more than nine figures: 1 klitzing 
= h/e 2 = 25812.807557(18) ohm. 

In 2d resistance and resistivity have the same dimensions, and coincide in 
the transverse case. We write therefore the Hall resistance as 

V y /I x =Rk = Rxy = ~. (2.14) 
v e A 

Upon obvious dimensionality arguments, even in the quantum case the Hall 
resistance can be written in this way; but then the concentration- and in- 
dependence of v are expected to be very different from the simple monothonical 
form of Eq. (2.13). 



2.4.2 Landau levels 

In quantum mechanics, the Schrodingcr equation for an electron in 2d subject 
to a perpendicular B field (and in a flat potential) can be exactly dealt with, 
both in the Landau gauge and in the central gauge. The spectrum is discrete 
e n = (n + ^)fku c . We define the magnetic length as £ = (hc/eB) 1 / 2 ; it diverges 
in the zero-field limit, and is of the order of 100 angstrom in a typical quantum 
Hall experiment. In the Landau gauge (A x = By, A y = 0) the eigenfunctions 
with energy n are 

iP nk (x,y)cxe lkx Xn(y-£ 2 k), (2.15) 

where Xn{y) are harmonic oscillator eigenfunctions with frequency uj c . Each LL 
is infinitely degenerate (one eigenfunction for each fc). For a system of area A, 
the number of states in each level is Af = A/(2ir£ 2 ); this has a simple form in 
terms of the magnetic flux <f> through area A: Af = (f>/4>o- 

If we now consider N noninteracting electrons, the lowest LL is completely 
filled when N = Af; more generally, one expects a periodicity in the filling factor 
v = N/N = N(j)/(f> , whenever v crosses an integer value, in most physical 
properties. 



2.4.3 The experiment 

The Hall resistance of a noninteracting 2d electron gas has been computed 
quantum- mechanically by Ando in 1974 [38]. The result, when expressed as in 
Eq. (2.14) showed indeed oscillations in v with integer period. The experiment, 
performed by von Klitzing and collaborators in 1980 [37], provided qualitatively 
different and very surprising results, shown in Fig 2.6. The discovery of the 
quantum Hall effect triggered a revolution with far reaching consequences in 
electronic structure theory at large; Klaus von Klitzing was awarded the Nobel 
prize in 1985. 
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Figure 2.6: The original figure from von 
Klitzing et al. Ref. [37]. The gate volt- 
age V g was supposed to control the car- 
rier density. Instead, the Hall resistance 
is quantized and insensitive to V g over 
a large interval; over the same interval, 



~S N the longitudinal resistance vanishes. 



In the original experiment, the 2d electrons were confined by a MOSFET 
(metal-oxide-scmi-conductor field-effect transistor); later, higher mobilities were 
obtained at semiconductor heterojunctions. Fig. 2.6 shows a very robust 
plateau, where R xx — and R xy — 6453.3 ± 0.1, corresponding to the fill- 
ing factor v = 4. The accuracy in the quantized R xy value is clearly far beyond 
the experimental control of the carrier concentration and of the B field uni- 
formity over the sample. A novel, qualitatively different, state of matter was 
discovered. In modern jargon, the plateaus are "topologically protected" . 

A modern realization of the integer quantum Hall effect is shown in Fig. 
2.7, where p xx and p xy are plotted as a function of the magnetic field. The 
plateau quantization is accurate to nine figures. The 2d electron gas is typically 
confined at a GaAs/GaAlAs heterojunction. The v = 1 value is achieved above 
~ 10 tesla; at low field (high i>) the system becomes dissipative (p xx > 0), while 
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Figure 2.7: A modern realization of the integer quantum Hall effect 
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the classical linear behavior of p xy is recovered; the slope depends on electron 
concentration n. 



2.4.4 Early theoretical interpretation 

The breakthrough paper, by Laughlin, appeared as early as 1981 [39]. This 
is a remarkably concise paper (two pages) which, in retrospect, is based on 
topological arguments. One key ingredient of the theory is disorder, in fact, the 
quantum Hall effect becomes less spectacular for very "clean" samples, while 
some "dirtyness" enhances the effect. 

Laughlin devised a Gedankenexperiment based on the setup shown in 
Fig. 2.8. The corresponding 2d Schrodinger Hamiltonian in the Landau gauge 
is 



H(ip) = 



+ eEy + r{x,y), (2.16) 



where E is the field across the ribbon, and "Y is an arbitrary substrate potential. 
The addition of a constant vector potential along x, A x — ► A X +AA, in Eq. (2.16) 
corresponds to threading a flux ip = L AA through the cylinder; we use the 
symbol ip, not to be confused with the real magnetic flux <f) normal to the 
surface. 

Similarly to what discussed in Sec. 1.6, the eigenvalues acquire a <p de- 
pendence. According to Eq. (1.21), if e n (p) is the n-th eigenvalue the current 
transported by the corresponding eigenstate is I x = —cde n (ip)/dtp. For an 
independent-particle system with N carriers the current is thus 

where U(<p) is the total energy of the system. Implicitly, we are assuming a 
dissipationless system. 

The expression in Eq. (2.17) for the current is remarkably simple, general, 
and robust: it does not depend on the substrate potential i / (x,y), nor the 
number N of carriers, and not even on their mass m. But for a disordered 
potential the eigenstates come in two kinds: localized and extended. The latter 
ones are phase-coherent round the loop, while the former are exponentially 
localized for L — > oo. The localized states are insensitive to the flux insertion 
(like the OBCs eigenstates in Sec. 1.6), and the whole current is carried by 
the delocalized ones. Therefore Eq. (2.17) provides a nonzero result insofar 
at least one of the occupied eigenstates in the disordered sample is extended, 




Figure 2.8: Geometry for Laughlin's 
Gedankenexperiment. A 2d channel is 
bent into a loop of circumference L, and 
a magnetic B field of constant magni- 
tude pierces the cylinder normal to the 
surface. A current I = I x circles the 
loop; Vh = V y is the Hall voltage. The 
loop may be threaded by the inaccessi- 



ble flux ip. 
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Figure 2.9: The density of states for a 2d system of noninteracting carriers, (a) 
Clean system, with zero substrate potential, (b) Actual sample, in presence of 
substrate disorder and impurities. 



i.e. phase-coherent round the loop; besides this, the number and nature of the 
current-carrying stated is irrelevant. It is therefore crucial to address the nature 
of the single-particle eigenstates in a quantum Hall sample. For a clean sample 
(flat substrate) the LLs are sharp, all eigenstates are extended (in the Landau 
gauge), and the density of states is a series of delta functions, shown in Fig. 2.9 

(a) ; the weight of each delta is <f>/</>o- In presence of disorder, the deltas broaden 
into alternating bands of localized and extended states, as sketched in Fig. 2.9 

(b) . 

The electron fluid is in the quantum Hall regime whenever the Fermi level 
falls in a region of localized states. Therefore a xx = (the fluid is a "quantum 
Hall insulator"), and p xx = (transport is dissipationless). 

We now imagine to adiabatically increase the vector potential by an amount 
AA = (f>o/L, where 0o is a flux quantum: all of the current-carrying states are 
mapped back into themselves, while the localized ones are unaffected. Hence the 
ground state has the same energy; nonetheless Eq. (2.17) implies U(ip + (fio) — 
U(ip) ~ —<PqI x /c ^ 0. This is only possible if an integer number of electrons is 
transferred from one cylinder edge to the other, each of them contributing the 
energy eV x . If we call — v such integer number, the relationship is then 

cj> I x /c = veV v ; i? H = V v /I x = i°- = I A. ( 2 .18) 

vce v e z 

The flux ip acts therefore as a charge pump; the pump cycle is one flux quantum 

<?v 

Ideally, the sample ground state can be continuously "deformed" from dirty 
to clean. Insofar as the Fermi level stays is in a region of nonconducting states, 
the (topological) integer v cannot change, even if the number of current carry- 
ing states does obviously change. The identification of v with the number of 
filled LLs comes from the clean-sample limit, which is exactly soluble. Setting 
y(x, y) = the cigenfunctions of Eq. (2.16) are 

cF 

iP nk (x,y)=e lkx X n(y-yo), yo = t 2 k -. (2.19) 

UJ c tS 

For a finite L, the allowed fc's are integer multiples of 2-k/L and the correspond- 
ing centers yo are spaced by 2tt£ 2 /L — Lcf>o/<p. Threading a flux ip shifts yo 
linearly in ip; when <p equals one flux quantum each cigenfunction goes over to 
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the next. Therefore one carrier is shifted for each n; the integer index v measures 
therefore the number of occupied LLs. Similar arguments can be reformulated 
in different gauges and in different geometries [40, 41]. 
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Chapter 3 
Berry-ology 



Bcrry-ology is used as synonymous for geometry in nonrelativistic quantum 
mechanics. Topological (i.e. quantized) quantities are defined via geometrical 
quantities, analogously to what we did above in Sec. 1.2. But many important 
quantities (notably the Berry phase) are merely geometrical. 

Let us assume that a generic time-independent quantum Hamiltonian has a 
parametric dependence. The Schrodinger equation is 

ff«)|*(€)> = £(€)!*«)>, 

where the ci-dimcnsional real parameter £ is defined in a suitable domain of R d : 
a 2d £ has been chosen for display in Fig. 3.1. In most of this Section we discuss 
the most general case, and therefore we do not specify which quantum system is 
described by this Hamiltonian, nor what the physical meaning of the parameter 
lis. 

In the subsequent Chapters ^(l)) will be identified with either a single- 
particle wavefunction (a.k.a. orbital) or a many-electron wavefunction. As for 
the parameter £, it may be a nuclear coordinate, a phase angle, a magnetic 
flux, a Bloch vector, a momentum, and more: it could therefore have various 
dimensions. Sometimes, the parameter £ is called the "slow variable", while the 
electronic coordinates are the "fast variable" . At the end of this Chapter, Sec. 
3.9, we address the special case where the parameter £ is identified with a Bloch 
vector. 

The state vectors are all supposed to be normalized and to reside 

in the same Hilbert space: this amounts to saying that the wavefunctions are 
supposed to obey ^-independent boundary conditions. We focus on the ground 
state |5 , o(^)), an d we assume it to be nondegenerate for £ in some domain of 
R d . 

Any quantum mechanical state vector is arbitrary by a constant phase fac- 
tor. Here we refer to choosing this phase as to the choice of the gauge. The 
semantic is a bit ambiguous. In presence of magnetic fields, we may change the 
magnetic gauge: this changes the Hamiltonian and the cigenfunctions. Once 
the magnetic gauge — hence the Hamiltonian — is fixed, we still remain with the 
phase arbitrariness referred to above. All measurable quantities (e.g. expecta- 
tion values) are obviously gauge-invariant (in both senses), but the reverse is 
also true: all gauge-invariant properties are — at least in principle — measurable. 

It is expedient to define the ground-state projector (a.k.a. density matrix) 



20 



and its complement, i.e. 

= l*o«)X*o«)|; <3«) = i-P(€). (3.2) 

Both P(£) and Q(£) are gauge-invariant (for a fixed Hamiltonian) . 

3.1 Phases and distances 

We define the phase difference between the ground eigenstates at two different 
£ points in the most natural way: 

K*o(€i)l*o(€ 2 )>l ' 1 ' ' 

A^ 12 = - Im log (*o(€i)|*o(€ 2 )> • (3-4) 
For any given choice of the two states, Eqs. (3.3) and (3.4) provide a A</?i 2 
which is unique modulo 27r, except in the very special case that the states are 
orthogonal. However, it is also clear that such A<^?i 2 is gauge-dependent and 
cannot have, by itself, any physical meaning. 

The distance between quantum states has been defined by Bures [42] as: 

^ 2 = l-|(*o(^)|*o(^ 2 ))| 2 . (3.5) 

Such distance fulfills the familiar axioms from calculus textbooks; it vanishes 
when the two states physically coincide (i.e. independently of the phase factor), 
and is maximum (equal to one) when the states are orthogonal. At variance 
with A(fi2 defined above, the Bures distance is gauge-invariant, and can be 
explicitly expressed in terms of ground-state projectors 

D\ 2 = 1 - Tr {PfoJPfo)}, (3-6) 
where "Tr" is the trace over the Hilbert space. 

3.2 Berry phase 

We have already observed that the phase difference A^i 2 between any two states 
is gauge-dependent and cannot have any physical meaning. Matters are quite 
different when we consider the total phase difference along a closed path which 
joins several points in a given order, as shown in Fig. 3.2: 

7 = A</?12 + A(f 2 3 + A<£ 34 + A<£>41 

= - Im log (*o(€i)l*o(€2)X*o(€2)l*o(€ 3 )> x 

x (*o(€ 3 )l*o(€4)X*o(€ 4 )l*o(€i)>. (3.7) 




MC,» 



Figure 3.1: State vectors in the two- 
dimensional £-space. The phase dif- 
ference between two of them is de- 
fined as e" iA ^ - W&Jgo&j) 



and their distance 

K*o(£i)|*o(£ 2 )}| 2 - 



K*o(€i)l*o(€ 2 )>l ' 
as D\ 2 = 1 — 
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Figure 3.2: A closed path joining four 
states in £-space. 



It is now clear that all the gauge-arbitrary phases cancel in pairs, such as to 
make the overall phase 7 a gauge-invariant quantity. The above simple-minded 
algebra leads to a result of overwhelming physical importance: in fact, a gauge- 
invariant quantity is potentially a physical observable. In essence, this is the 
revolutionary message of Berry's celebrated paper, appeared in 1984 [27, 43]. 

Next we consider a smooth closed curve C in the parameter domain, such 
as in Fig. 3.3, and we discretize it with a set of points on it. Using Eq. (3.3), 
we write the phase difference between any two contiguous points as 



If we further assume that the gauge is so chosen that the phase varies in a 
differentiable way along the path, then from Eq. (3.8) we get to leading order 



In the limiting case of a set of points which becomes dense on the continuous 
path, the total phase difference 7 converges to a circuit integral: 



iAtp _ 



(*o(g)l*o(g+A$)) 
K*o(OI*o(£+A£))r 



(3.8) 



in A£: 




(3.9) 




8=1 



(3.10) 



where *4(£) is called the Berry connection: 




(3.11) 




Figure 3.3: A smooth closed curve C in 
£-space, and its discretization. 
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Since the state vectors are assumed to be normalized at any £, the connection 
is real; we can therefore equivalently write 

Ate) = -Im <tt (€)|V£*o(€)>- (3-12) 

A number of manifestations of the Berry phase occurring in molecular and 
condensed matter phenomena will be discussed in the present Notes. Many 
reviews papers; here we quote Refs. [43, 44, 45, 3, 12]. 

At this point it is also worth emphasizing that in computational physics there 
are no derivatives. The ground state \l/o(£) is generally found by diagonalizing 
a matrix on a finite set of £ points, and the phase (i.e. the gauge) is chosen by 
the diagonalization routine; the phase is therefore nonsmooth and possibly even 
random. The discrete form in Eq. (3.11) at finite M is the one universally used 
in numerical work; it is clearly unaffected by any erratic phase factor. 

3.3 Connection and curvature 

The loop integral of the Berry connection (i.e. the Berry phase 7) is non trivial 
in two cases: either the curl of «4.(£) is nonzero, or the curl is zero but the curve 
C is not in a simply connected domain. In the former case, we can invoke Stokes 
theorem; the formulation is very simple when £ is a 3d parameter. If C is the 
boundary of a surface E (i.e. C = <9£), and the curl of «4.(£) is regular on X, 
then Stokes' theorem reads 

7 = / Ate) ■ d£ = / Sl{i) ■ n da, (3.13) 

where ft is the Berry curvature, defined as 

(Ite) = x Ate) = -Im (V£* (€)l x |V € *o(£)> 

= i<V£* (OI x |V € * (€)>. (3-14) 

with the usual meaning of the cross product between three-component bra and 
ket states. Equation (3.13) may be spelled out by saying that the curvature is 
the Berry phase per unit area of S. 

For d ^ 3 the Berry curvature is conveniently written as the dx d antisym- 
metric matrix 

fl a pte) = -21m (0a*o(€)lfy*o«)>; (3-15) 

Greek subscripts are Cartesian coordinates throughout, and d a = d/dt; a . The 
Stokes theorem can still be applied, generalizing Eq. (3.13) to 

1=\J^<* A denote). (3.16) 

The Berry connection is also known as "gauge potential" , and the Berry 
curvature as "gauge field" [45]. It is worth pointing out that the former is 
gauge-dependent, while the latter is gauge- invariant and therefore corresponds 
in general to a measurable quantity, even before any integration. The two quan- 
tities play (in £-space) a similar role as the vector potential and the magnetic 
field in elementary magnctostatics: A(r) is gauge-dependent, nonmeasurable; 
B(r) = V r x A(r) is gauge- invariant, measurable. 
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The Berry phase 7, defined as the integral over a closed curve C of the 
connection, is gauge invariant only modulo 2tt. This indeterminacy is resolved 
by Eqs. (3.13) and (3.16) whenever the curve C is the boundary <9E of a surface 
E where the curvature is regular. In fact, the curvature is gauge-invariant and 
has no modulo 2tt indeterminacy. 

3.4 Chern number 

The rhs of Eqs. (3.13) and (3.16) is the flux of the Berry curvature on the surface 
S; such flux remains meaningful even on a closed surface (e.g. a sphere or a 
torus), in which case <9X is the empty set. The key result is that such an integral 
is quantized. Here we limit ourselves to 3d, where we identify £ with the sphere 
S 2 (Fig. 3.4): 

±- f n(£)-nda = Ci; (3.17) 

^ Js 2 

C\ is an integer e Z, called Chern number of the first class. 

The proof is based on a similar algebra as for Dirac's theory of the magnetic 
monopole [44, 46]. The theorem goes sometimes under the name of Gauss- 
Bonnet-Chern theorem; the analogy with Eq. (1.4) is perspicuous. A specific 
example is dealt with in detail in Sec. 4.1. 

The curvature is regular (and divergence- free) on the closed surface S 2 ; the 
lhs of Eq. (3.17) is the flux of fi(£) across S 2 . The integrand fi(£) is the curl of 
the connection «4.(£); the latter in general cannot be defined as a single- valued 
function globally on S 2 , but only on patches of it [44, 46]. To fix the ideas, 
suppose that fi(£) is singular at £ = 0, and that S 2 is the spherical surface 
centered at the origin (Fig. 3.4). We cut this surface at the equator £ z = and 
we consider the flux across the two open surfaces: 

f fi(£) • n da = [ • n da + [ fi(£) • n da. (3.18) 

Js 2 Js+ Js- 

We notice that dS+ = dS- = C, but the surface normals n have opposite 
orientations. From Stokes theorem, Eq. (3.13), we get: 

/ n(€)-nda = ± <f A±(€)-d£ (3.19) 

Js± Jc 

[ £l(€)-nda= I A+{£)-d£- I A~(£) • d£ . (3.20) 
Js 2 Jc Jc 

The two upper and lower Berry connections A~t (£) may only differ by a gauge 
transformation; the rhs of Eq. (3.20) is the difference of two Berry phases on 




Figure 3.4: A sphere cut at the equator 
in two hemispheres. 
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the same path and is necessarily a multiple of 2ir. This concludes the proof of 
Eq. (3.17). 

We emphasize that the Chern number is a robust topological invariant of 
the wavefunction, and is at the origin of observable effects. The Chern number 
made its first appearance in electronic structure in 1982, in the famous TKNN 
paper about the quantum Hall effect [47] (Sec. 4.2.3). Nowadays more complex 
topological invariants are in fashion, and they characterize a completely novel 
class of insulators, called "topological insulators" [10, 14, 48, 49, 50, 51, 52, 53]. 



3.5 Metric 

Starting from Eq. (3.5), the infinitesimal distance is 

d 

D U+d£ = E 9 a 0(Z)dHad&, (3.21) 

a,0=l 

where the metric tensor is easily shown to be 

9*M) = Re <0 a tt o (€)|fy*o(£)> 

- (aa*o(€)l*o(€)X*o(€)|9/j*o«)> 

= Re <0 a *„(€)l3(€)lfy*o(€)>; (3.22) 

the projector Q{£) is the same as defined in Eq. (3.2). This quantum metric 
tensor was first proposed by Provost and Vallee in 1980 [54]. 

At this point we may compare Eq. (3.22) to Eq. (3.15), noticing that the 
insertion of <§(£) is irrelevant in the latter, i.e. 

n a/J ({) = -2Im (0 a *o(€)IO(Olfy*o(O>- (3.23) 

It is therefore clear that g a $ and r2 Q( g are, apart for a trivial —2 factor, the real 
(symmetric) and the imaginary (antisymmetric) parts of the same tensor, which 
we are going to call T a p in the following: 

^a/J«) = (a a *o(€)|0«)|^* (€)>- (3-24) 

The metric-curvature tensor T a p is gauge- invariant. A compact equivalent ex- 
pression is 

r a p{i) = Tr {d a P{£)Q(Z)d p P{t)}, (3.25) 
manifestly gauge-invariant and Hermitian. 



3.6 Sum over states 

The ^-derivatives entering many of the previous equations, e.g. Eq. (3.24), can 
be expressed starting from perturbation theory: 

|*o(€ + A€))-|* (€)> (3-26) 
„ v-W ^n(g)|[g(g + Ag)-g(Q] 1^(0) . 
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lwe) = E K(e>M^^. (3.27) 

These seemingly obvious and innocent formulae need some caveat. It is clear 
that inserting Eq. (3.27) into the Berry connection, Eqs. (3.11) and (3.12), we 
get a vanishing result for any £. This happens because the simple expression 
of Eq. (3.26) corresponds to a very specific gauge choice (called the parallel- 
transport gauge [3]); multiplying the rhs by a ^-dependent phase factor is le- 
gitimate, and must not modify any physical result, while the Berry connection 
is instead affected. Nonetheless, since our FapiS) is a gauge- invariant quantity, 
we may safely evaluate it in any gauge, including the parallel-transport gauge, 
implicit in Eq. (3.27). The result is 

^a/s(€) (3-28) 

(*o«)|aair«)|*n«)X* n (€)|a/jff(€)|*o(€)> 



n^O 



[£*,(€) -£»(€)] 2 



This expression shows explicitly that both the curvature and the metric are 
ill defined and singular wherever the ground state is degenerate with the first 
excited state. Indeed, this is the main reason why the domain may happen not 
to be simply connected. 



3.7 Time-reversal and inversion symmetries 

According to Eq. (3.25) the ground-state projector uniquely determines the 
curvature 

n a/J (£) = -2Im Tr {d a P(H)Q(t)d fj P(Z)}. (3.29) 

It is therefore expedient to analyze the symmetries of the ground-state projector 
P(£), which coincide with the symmetries of the Hamiltonian; these in turn 
depend on the nature of the parameter £. We only address spinless electrons 
(no spin-orbit coupling). 

When £ is even under time reversal (like e.g. a nuclear coordinate), then 
time-reversal invariance implies that both if (£) and P(£) are real for any £, and 
Eq. (3.29) warrants that the curvature is everywhere vanishing. The Berry phase 
7 can be nonzero (modulo 2ir) only if the curve C loops around a singularity or, 
more generally, it does not lie in a simply connected domain; the only allowed 
values are 7 = or 7 = n. 

When instead £ is odd under time-reversal (like e.g. a momentum) then 
time-reversal symmetry requires P(—£) = P*(£), therefore 

«a/j(-€) = -JMO- (3-30) 

The Berry phase along an inversion- symmetric path vanishes; the Chcrn number 
vanishes as well. 

Next we switch to inversion symmetry. When evaluating any ^-dependent 
matrix-element (or trace) , inversion of the coordinates at fixed £ is equivalent to 
keeping the coordinates fixed and inverting £. This statement holds whether £ 
is a coordinate or a momentum; both are in fact odd under inversion. Therefore 
inversion symmetry implies P{— £) = P (£), and 

n a/J (-€) - n Qj9 (£). (3.31) 
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If both time-reversal and inversion symmetry are present, then the Berry curva- 
ture is everywhere vanishing. The Berry phase can be only zero or 7r; the latter 
case requires a domain which is not simply connected, as above. 

Crucial to the above arguments is the fact that the double derivative appear- 
ing in Eq. (3.29) are even under either time-reversal or inversion. Summarizing 
the symmetry results, for the case where £ is a momentum: a non vanishing 
Chern number can only occur in absence of time-reversal symmetry, but may 
occur even in inversion-symmetric cases. 

3.8 NonAbelian case 

Very soon after the appearance of Berry's milestone paper, Wilczek and Zee [55] 
introduced a many-state generalization. Suppose we do not focus on a single 
state in the Hilbcrt space (say the ground state), while we are interested instead 
in the behavior of n different states altogether, as a function of The original 
motivation was the possibility of degeneracy amongst the n lowest eigenstates 
IV'j(^)) °f a given Hamiltonian at some points of the path, and they formulated 
the theory under the hypothesis that these lowest states are never degenerate 
with the n+l-th. Strictly speaking, an Hamiltonian is unnecessary: one only 
needs to unambiguously identify an n-dimensional (n fixed) manifold of states, 
as function of the parameter £. A gauge transformation is then a n x n unitary 
matrix; in the Abelian case, this matrix is a diagonal one, with unitary elements. 
The gauge invariant quantity which defines the manifold is the n-dimensional 
projector 

n 

= El^(0><^-(€)|. ( 3 - 32 ) 

i=i 

Within loss of generality, we may address the case which is most interesting in 
electronic structure. We assume that the n states \ipj(£)) are spin orbitals, and 
that l'I'(O) i s the many-body wavefunction built as their Slater determinant: 

*(£)> = ^hM£hM£)---^v(£)|. (3.33) 

We can therefore apply some of the results of the previous Sections. The many- 
body Berry phase is given in Eqs. (3.10) and (3.11), which we prefer to rewrite 
here in the form 

e" 47 = exp <j> A{£) ■ d£, (3.34) 

where *4 is the connection of the many-body wavefunction. 

In the nonAbelian case the connection generalizes to a vector ofnxn Hcr- 
mitian matrices 

= -Im (^(€)|V^(€)>, (3-35) 
and the Berry phase factor of Eq. (3.34) generalizes to the unitary matrix 

e" ir = & exp <j> <£/(£) • d£. (3.36) 

Here ^ is a path-ordering operator, owing to the noncommuting nature of 
the £/ matrices at different £ in the nonAbelian case. The & operator has a 
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precise meaning when the series expansion of the exponential is considered. The 
discretization of Eq. (3.36) is rather straightforward and will not be discussed 
here [19]. 

While the single-state phase factor e - * 7 is gauge-invariant, the Wilczek-Zcc 
unitary matrix e~* r is only gauge- covariant, i.e. a gauge transformation yields 
a matrix unitarily equivalent to e~* r ; the gauge can be fixed by choosing the 
vectors \ipj(£)) at one point of the path. The eigenvalues 7j of the Hermitian 
matrix T, defined modulo 2ir, are gauge-invariant and in principle individually 
observable. It is easily proved that their sum, i.e. the trace of T, coincides with 
the Berry phase 7 of the many-body wavefunction, Eq. (3.34) [19]. 

In order to write the metric-curvature tensor in the nonAbclian case, we 
start writing the many-body Abelian metric-curvature tensor of the Slater- 
determinant wavefunction, Eq. (3.24), as 

«£) - (0a*o(€)|0/J*o(€)) - (aa*0«)|P(€)|9/J*0«)>. (3-37) 

Next, we need now to explicitate the Cartesian indices a, (3 at the same time as 
the matrix indices j,f. The nonAbelian metric-curvature tensor is the general- 
ized form of Eq. (3.25), i.e. 

PafljAt) = (d a 1>j(t)\WAt)) - (9a1>j(t)\P(t)\WAt)h (3-38) 

where now P(£) is the single-particle projector; even this matrix is gauge- 
covariant. If we rewrite the Cartesian components of Eq. (3.35) as &f a ,jj'{£) = 
— Im (V'j (€)l^aV'j'(^))j the nonAbelian curvature becomes 

tt a 0jj< (£) = d a sipjji (£) - dpsrf a jji (£) - i[*f a {£),&tp{£)]jj> ■ (3.39) 

With respect to Eq. (3.15) notice the presence of an extra term, which vanishes 
in the Abelian case. At fixed jf Eq. (3.39) is clearly antisymmetric in the a(3 
Cartesian indices, while at fixed a/3 it is an Hermitian n x n matrix. 

The trace over the j index of the (nonAbelian) metric-curvature tensor equals 
the corresponding (Abelian) metric-curvature tensor of the many-body ground 
state, Eq. (3.25): 

n 

i=i 

n n 

= £<^(£)l^(£)> - E (9c.Mi)WAt)MAt)\d{iMi)) 

= F a p{Z). (3.40) 
The proof of the last line of Eq. (3.40) is tedious, but straightforward. 

3.9 Bloch orbitals 

We have remained very general so far. The case where the parameter £ coincides 
with the Bloch vector k bears a particular relevance in the context of the present 
Notes. In the framework of first-principle calculations for crystalline systems, 
the Bloch states IV'jk) are the KS orbitals. 
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The domain where the k parameter varies (the reciprocal cell or, equivalcntly, 
the Brillouin zone, BZ) has the geometry of a torus in Id, 2d, 3d. The whole 
BZ is a closed surface; we denote as G a reciprocal vector. 

The Bloch orbital of the j-th band is \ipjk) = e lk ' r |ujk), where |ujk) arc 
BZ-periodical, and are eigenfunctions of the Hamiltonian = c~* kr Hc tk r . 
While the at different k's are mutually orthogonal, the |uj k ) live instead 

in the same Hilbert space (they are all BZ-periodical). 

The physical meaning of all the mathematical quantities introduced next 
will be discussed in the following Chapters, and not anticipated in the present 
one. 

The Berry connection of the j'-th orbital is 

Aj{k) = i(tijk|V k tijk). (3.41) 

The relative phases at different k's are arbitrary. Whenever possible, it is cus- 
tomary to enforce the so-called periodic gauge l^jk+c) = \i>jk)> which implies 

lujktc) = e- lG ' r |^ k ). (3.42) 

We stress, however, that in topologically nontrivial crystals it is generally im- 
possible to adopt a periodic gauge. 

The interesting closed paths C on the torus are lines across the reciprocal 
cell, from one face to the opposite one. For an insulator with n occupied bands 
the Berry phase is, according to the previous section, 

n n 

7 = iV / A 7 (k)-dk = iV / dk- (ti jk |V k Mjk)- (3.43) 
^[Jc 1=1 Jc 

This Berry phase depends on the choice of the origin in the crystal cell. For 
centrosymmetric crystals, if the origin is at a center of inversion symmetry, the 
only allowed values are 7 = and 7 = it (modulo 2tt) . 

In case of band crossings the definition of individual bands is ambiguous, 
but the Berry phase in Eq. (3.43) is not affected by such ambiguity. More 
generally, the results of the previous Section show that Eq. (3.43) is invariant 
by a nonAbelian gauge transformation, i.e. by mixing of the occupied orbitals 
between themselves by an arbitrary unitary matrix Ujj>(k). The mixed orbitals 
are no longer Hamiltonian eigenstates; any gauge where instead the |uj k ) are 
eigenstates will be called "Hamiltonian gauge" . 

The discretization of Eq. (3.43) is nowadays implemented in most electronic 
structure codes [56, 57, 58, 59] in order to compute the macroscopic polarization 
of crystalline dielectrics (Sec. 5.3). Such discretization is based on the following 
result: if |*(k)) is the Slater determinant of the n occupied \ujk), then 

(*(k!)|*(k 2 )) = det S(ki, k 2 ), (3.44) 

where 5(ki, k 2 ) is the n x n overlap matrix 

•%'(ki,k 2 ) = (u jkl \u fk2 ). (3.45) 

We discretize with M+1 points on the line, where \s.m+i = ki+G; it is understood 
that |ujk M+ i) = c~ lG ' r |ujki)- The discretized formula is then 

7 = i [ dk ■ (*(k)|V k *(k)) -> -Im log n^ 1 (*(k s )|*(k ;H . 1 )) 
Jc 

= -Im log det n^^k^k^). (3.46) 
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Notice that this is numerically gauge invariant, i.e. it does not depend on 
how the diagonalization routine chooses the phases and/or the ordering of the 
eigenvectors. 

The metric-curvature tensor of n occupied bands obtains straightforwardly 
from Eq. (3.40): 

n n 
J r a[}{k) = '^2(d ct Uju\dpUjk) - ^ (5a u jk|Uj'k)(u J 'k|9/3U :) k)- (3.47) 

This is usually integrated over the BZ; being gauge invariant, it carries in prin- 
ciple physical meaning even before any integration. Indeed, the k-dependent 
(single-band) curvature enters the theory of semiclassical transport in crystalline 
solids [60, 61, 12], Sec. 4.5. 

Eq. (3.47) is the trace of the nonAbelian metric-curvature whose expression 

is 

n 

i"=i 

OO 

= ^2 { d ^ u 3k\ u s^){u s v.\df3Uj^) . (3.48) 

s— n+l 

The nonAbelian metric and curvature are 

9a0,jj' = \[&<xt),jj>(k) +^3a,jy(k)] 

fia/J,ii' = i[& a p,jj'QL)-<P{l a ,33'M]- (3-49) 

The many-band curvature obtains from either Eq. (3.47) or Eq. (3.49) as 

n 

n af3 = -2Im ^2(d a u jk \dpu jk ). (3.50) 
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Chapter 4 



Manifestations of the Berry phase 
4.1 A toy- model Hamiltonian 

Our toy model here is a two-level spinless Hamiltonian, of the form 

H(0 = €■(? 

= £ (sin # cos ip a x + sin $ sin y> er,, + cos •& a z ), (4.1) 

where a a are the three Pauli matrices. The spectrum is non degenerate for 
£ ^ 0, and the lowest eigenvalue is — £. Upon symmetry arguments, we can 
already guess the curvature to be isotropic. 

4.1.1 Connection and curvature 

The lowest eigenvector is 

This corresponds to a specific gauge choice; the eigenvector can be multiplied by 
an overall (i?, ^-dependent phase factor. The Berry connection and curvature 
are 

A? = i(ip\d^)=0 
A v = i(ip\d v ip) = sin - 

ft = d<>A v - d v Ae = \ sinz9. (4.2) 

The curvature is gauge-invariant, while the connection is gauge-dependent. 
Within our gauge choice the connection displays a vortex at the south pole 
(i? = 7r); other gauges yield the singularity at a different point, but a singularity 
is unavoidable. It is impossible to find a gauge which is smooth on the whole 
closed surface, and a nonsingular connection; the singularity is often called an 
"obstruction". The algebra is the same as for Dirac's theory of the magnetic 
monopole [44, 46] : the degeneracy at the origin is the monopole, and the singu- 
larity is the "Dirac string" . 
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Figure 4.1: A closed curve C on the surface of the 
sphere, and the solid angle spanned by it. 

4.1.2 Chern number 

The domain of the parameters ip) is a rectangle, which indeed has the topol- 
ogy of a torus: a closed surface. Integrating the Berry curvature therein we 
get 

-!- / Mdip = 1, (4.3) 

2-7T J S 2 2 

i.e. the Chern number of the lowest eigenstate in this problem. This integer 
measures the strength of the singularity (magnetic monopolc), which resides in 
a site inaccessible to the quantum system. The highest eigenstate has C\ = — 1, 
since the total Chern number is zero. 

This simple example illustrates well the meaning of a topological invariant of 
the quantum mechanical ground state. The Chern number is in fact very robust 
under continuous deformations of the surface and of the Hamiltonian: its value 
is always one insofar as one (and only one) degeneracy point is included in the 
closed surface. 

4.1.3 Berry phase 

Suppose now we evaluate the Berry phase over any closed curve C on the sphere 
(Fig. 4.1) 

7 = (4.4) 

Owing to Stokes theorem, the Berry phase for this toy model problem clearly 
equals the solid angle spanned by the curve, divided by 2. The inherent 47r 
arbitrariness in the solid angle leads to the well known 2tt arbitrariness in the 
Berry phase: for instance at the equator 7 = ir modulo 27r. If we cut the sphere 
in two hemispheres, as in Sec. 3.4 (see Fig. 3.4), the difference of the boundary 
Berry phases equals modulo 2tt, i.e. 2n times an integer, as it must be. But in 
order to tell which integer (the actual Chern number) the two connections are 
useless: one has to integrate the curvature, as in Eq. (4.3). Despite this feature, 
in numerical work Chern numbers are typically computed via Berry phases, as 
described in the next Section. 

4.1.4 Numerical considerations 

This exactly soluble example also provides the occasion for illustrating the stan- 
dard computational approach to Chern numbers. Suppose we discretize the 
ip) domain with a rectangular mesh, and that we diagonalize the Hamil- 
tonian at the points of the mesh. The gauge at any point is chosen by the 
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diagonalization routine and is thus erratic; we only enforce the toroidal topol- 
ogy by requiring that the phases at the opposite edges of the rectangle are the 
same. 

Then for each small rectangle we compute the discrete Berry phase as in 
Eq. (3.7), i.e. 

7 = - Im log (^(t?, ip)\ V>(t? + At?, ip)} (V>0? + At?, ip) | V>(t? + At?, ip + Aip)} 
x (V(t? + At?,^ + A^)|t/>(t?,^ + A^))(^(t?,^ + A^)|t/>(t?,^)). (4.5) 

The Berry curvature is the Berry phase per unit (t?, ip) area. In this simple, 
analytically soluble, case we know the exact value; Eq. (4.2) implies for Eq. (4.5) 
7 = i sin t? At? Aip modulo 2ir. The Chern number is the integral over the 
domain, and is therefore equal to the sum of all the phases computed as in 
Eq. (4.5) and covering the whole domain. 

Last but not least: how do we get rid of the modulo 2n indeterminacy in 
Eq. (4.5)? The size of A^Ai^s is very small, and each contribution 7 to the 
sum is also small (proportional to AflAip), although Eq. (4.5) is in principle 
arbitrary modulo 2n. It should be now pretty clear that the right solution is in 
choosing the Im log branch with values in [— n, ir}. 

4.2 Early discoveries reinterpreted 
4.2.1 Aharonov-Bohm effect 

Here we reformulate the Aharonov-Bohm effect as a special case of a Berry 
phase. Suppose we have an electron in a box (infinite potential well) centered 
at the origin. We take the ground wavefunction as real, and we write it as x( r )- 
The time- independent Schrodinger equation is: 

X (r) - E X (r). (4.6) 

Displacing the center of the box at position R changes the Hamiltonian to 

ff(R) = £- + V(r-R) : (4.7) 



B 

quantum 
) system 




Figure 4.2: A particle in a box, trans- 
ported round a solenoid 



33 



we will identify the £ parameter with the box position R. Because of transla- 
tional invariance, the R-dependcncc of the state vectors is 

(r|V(R))=x(r-R), (4.8) 

while the eigenvalue is R-indcpcndcnt. 

Suppose now that a magnetic field is switched on somewhere in space. Then 
the Hamiltonian becomes 

2 + V(r - R) , (4.9) 

where A is the vector potential and e is the electron charge. It can be easily 
verified that a solution of the Schrodinger equation can be formally written in 
the form: 

(i#(R)) = cxp A(r') • dr'^j X (r - R). (4.10) 

But such a solution in general is not a single-valued function of r, since the 
phase factor depends on the path. Therefore we restrict ourselves to a less 
general case, where the magnetic field is generated by a solenoid: the B field 
is nonzero only within a given cylinder, and we don't allow our box to overlap 
this cylinder by suitably restricting the domain of R. This situation is sketched 
in Fig. 4.2. With such a choice the wavefunction, Eq. (4.10), is a single valued 
function of r for any fixed R, and is therefore an honest ground wavefunction. 
As for the dependence on R, Eq. (4.10) only guarantees local single- valuedness, 
since the domain is not simply connected: when the system is transported on a 
closed path winding once round the solenoid, the electron wavefunction picks up 
a Berry's phase. This phase difference can be actually detected in interference 
experiments. 

The Berry connection of the problem is 

A(R) = i(V(R)|V R V(R)) = -^A(R) - i J dr X (r - R)V RX (r - R), (4.11) 

where the last term vanishes. Therefore in the present case the Berry connection 
is proportional to the ordinary vector potential. A gauge transformation in 
the quantum-mechanical sense also coincides with an electromagnetic gauge 
transformation, which changes A while leaving B invariant. In fact, in this 
example B is essentially the Berry curvature. The Berry phase is 

7 = ~ / A(R) -dR = / A(R) ■ dR, (4.12) 
nc J c 0o Jc 

where 4>o is the flux quantum. Therefore 7 measures the flux of the magnetic field 
across the interior of the solenoid, a space region not accessed by the quantum 
system: above, we have called it "inaccessible flux" . Only the fractional part of 
the flux has physical meaning. 

4.2.2 Molecular Aharonov-Bohm effect 

Here we identify the "slow coordinate" £ with a d-dimensional nuclear coordi- 
nate, and the state vector |4'(^)) with the electronic ground-state wavefunction 
in the Born-Oppcnhcimcr approximation. 



v ' 2m 
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We start from the complete Hamiltonian H of an isolated molecular system, 
and we explicitly separate the nuclear kinetic energy: 



(4.13) 



a,/3=l 



where [x] indicates the electronic degrees of freedom collectively, p Q = 
—ihd/d^ a is the canonical momentum conjugated to £ a , and the inverse mass 
matrix SOT -1 in general may be a function of £, but not of the momenta. 

The Born-Oppcnhcimcr approximation starts by writing the eigenfunctions 
of Eq. (4.13) in the Schrodinger representation as the product ( [x] $(£)■ 
Our aim is obtaining an effective Schrodinger equation for the nuclear wave- 
function $(£), where the electronic degrees of freedom have been integrated 
out. We start considering the effect of the canonical nuclear momentum p on 
the product ansatz: 



We then multiply by the electronic eigenbra ( x &(^)| on the left, thus integrating 
over the electronic degrees of freedom. We get the effective nuclear momentum 
7T acting on $ as: 



7T $(o = [ P ift<*(€)|v € *(€)) ] <&(£) = [ p - nA(0 } $(o, (4.15) 



where we easily recognize the Berry connection. The momentum ir is the kine- 
matical (also called covariant, or mechanical) momentum, to be distinguished 
from p = —ihV^, which is the canonical momentum. 

Whenever the time scales of nuclear and electronic motions are well sepa- 
rated the coupling between different electronic states can be neglected, and the 
adiabatic approximations allows to treat the slow variable £ in if (£, [x]) as a 
classical parameter. The electronic eigenvalue E(£) of a given state (e.g. the 
ground state) plays therefore the role of a (scalar) potential for nuclear motion, 
whose effective Hamiltonian acting on $(£) is then: 



In the molecular physics literature the extra term in Eq. (4.15) is seldom 
mentioned, and n is identified with p. The reason is that for a time-reversal- 
invariant Hamiltonian, and in absence of spin-orbit interaction, the wave func- 
tion can always be taken as real. This corresponds to the parallel transport 
gauge, and the Berry connection vanishes at all £; the tradeoff is that — in some 
cases — the electronic wave function is not single valued along a closed path: see 
Fig. 2.3. The alternative approach, due to Mead and Truhlar [30, 62], is to 
choose a different gauge, where the electronic wave function is single valued and 
complex. The Berry phase is gauge invariant; the values allowed by time-reversal 
symmetry are and 7r; the two cases are experimentally distinguishable. 

We stress that, whenever the ionic motion is purely classical and governed 
by Newton's equation, the vector-potential-like term in Eq. (4.15) is irrelevant: 
the corresponding curvature (magnetic-field-like) is in fact identically vanishing 



P |*(€)> *(€) - -ift !*(€)> V£*(£) - ift|v € *(€)> *(€)■ 



(4.14) 




(4.16) 



a, 0=1 
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along the nuclear trajectory on the Born-Oppcnheimcr surface. We anticipate 
that the case where a genuine magnetic field is present — and the Hamiltonian 
is no longer time-reversal-invariant — is qualitatively different in this respect, see 
Sec. 4.3 below. 



4.2.3 Integer quantum Hall effect 

The famous TKNN (Thouless, Kohmoto, Nightingale, and den Nijs) paper ap- 
peared in 1982 [47] and marks the very first occurrence of a topological invariant 
(the first Chern number) in electronic structure. The outline provided here is 
inspired by Kohmoto [63]. 

We consider a two-dimensional independent-electron system in a lattice- 
periodical potential, and subject to a perpendicular B field. The Hamiltonian 
is not translationally invariant, but one can address the magnetic translation 
group. We choose a large enough "supercell", such that the magnetic flux is 
commensurate (i.e. an integer number of flux quanta 4>q thread the supercell) : 
in this case a continuous k vector can be defined in the magnetic Brillouin zone. 

As in Sec. 3.9,we define \ipj k ) = e' k ' r |«jk) and H k — e~ lk r He tk r ; the latter 
takes here the form 



m 



p + ftk+-A(r) +r(r), (4.17) 
c J 



where Y is the substrate potential. The velocity can be expressed as 

v = ^V k i/ k , (4.18) 

a formula often recurring in the present Notes in various forms, see e.g. 
Eqs. (1.18) and (2.17). 

The Kubo formula for transverse conductivity is [64] 



a xy - 2he Im ^ 7^2 / dk ( e „- e ,„)2 ( } 



2^ Im V 1 [ ^{ujk\dxHk\u jlk }{u rk \d y H k \u jk ) 



,2 If 
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If we now consider the case where the Fermi level lies in a gap, with n filled 
bands (Landau levels in a flat potential), Eq. (4.19) becomes the BZ integral 



•xy 



2 e l Im _i_ U y f ^Wu^ju^H^) (4.20) 
^ (27r) 2 i B z jriyt^ fe'k-e /k ) 2 



The integrand is just a simple generalization of the sum-over-states formula of 
Eq. (3.28). Using the same arguments as in Ch. 3 it is rather straightforward 
to arrive at the identity 



{uj k \d x H k \uy k ){uji k \d y H k \uj k ) 



(e jk - <• >- 

3 = 1 3 '=n+l J J 



n 1 

= lmy(d x u jk \d x u jk ) = --ft xy (\s), (4.21) 



3 = 1 
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where the many-band Berry curvature, Eq. (3.50), appears. Since the BZ is a 
torus, the BZ integral of the curvature equals 2-k times an integer, the (first) 
Chern number C. The milestone TKNN discovery is that Hall conductivity is 
a Chern number when expressed in klitzing -1 : 

o xy = - e -C. (4.22) 

Notice that the sign choices are not uniform across the literature. 

Conductivity is a property of the excitations of the system, as it is perspic- 
uous in the Kubo formula above. The Chern number, instead, is a ground state 
property. The identity relating them belongs to the general class of fluctuation- 
dissipation theorems, although this looks like an oxymoron, the Hall conduc- 
tivity being here dissipationless. The interpretation of the Chern number as 
a ground-state quantum fluctuation will be elaborated somewhere else in the 
present Notes. 

The topological nature of the observable explains its extreme robustness un- 
der variations of magnetic field, carrier density, substrate disorder, and more. 
The topological invariant C is identified with the filling v using the same argu- 
ments as in Sec. 2.4.4; the integer can only be varied by crosssing a conducting 
state. 

4.3 Adiabatic approximation in a magnetic field 

The general problem of the nuclear motion — both classical and quantum — 
in presence of an external magnetic field has been first solved in 1988 by 
Schmelcher, Cederbaum, and Meyer [65]. It is remarkable that such a funda- 
mental problem was solved so late, and that even today the relevant literature 
is ignored by textbooks and little cited. The solution is a manifestation of geo- 
metrical effects in electronic wavefunctions [66] , which appears in a spectacular 
way even when the nuclear motion is addressed at the classical level. 

When a genuine magnetic field, generated by some external source, acts on 
the molecular system, the Hamiltonian of Eq. (4.13) is modified by the addition 
of a vector potential term in the kinetic energies of both the nuclei and the 
electrons. Proceeding as in the zero field case, one writes an ansatz wavefunction 
and arrives at the effective Hamiltonian for the ionic motion, Eq. (4.16), where 
an extra term must be added to the kinematical momentum tt of Eq. (4.15). 
There are thus two vector potentials in the effective nuclear Hamiltonian: a 
geometric one, and a genuinely magnetic one. 

However, with respect to the zero-field case, there is a qualitative differ- 
ence whose importance is overwhelming. Since the electronic Hamiltonian is no 
longer invariant under time-reversal, the electronic wavefunction is necessarily 
complex, and the curvature is in general nonzero. No singularity is needed to 
produce geometrical effects on the nuclear motion; the Berry phase will be in 
general nonzero on any path in the space of nuclear coordinates. 

Suppose we arc interested into the nuclear motion at the purely classical 
level. The Hamiltonian of Eq. (4.16) — whose kinematical momentum tt includes 
now the two different vector potentials — yields the Hamilton equations of mo- 
tion, which can be transformed into the Newton equations of motion: within the 
latter, the effects of the vector potentials appear in terms of fields, in the form 
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of Lorentz forces. The curl of the magnetic vector potential obviously yields 
the magnetic field due to the external source; the curl of the geometric vector 
potential (Berry curvature) yields an additional "magnetic-like" field which is 
nonzero even on the classical trajectory of the nuclei. We stress that this is at 
variance with the zero-field case, where the Berry phase had no effect on the 
ionic motion at the classical level, and could only be detected when quantizing 
the ionic degrees of freedom. 

Within a naive Born-Oppcnhcimcr approximation — where Berry phases are 
neglected — the magnetic field acts on the nuclei as it they were "naked" charges: 
a proper treatment must instead account for electronic screening: this is pro- 
vided by the geometric vector potential. Surprisingly, there are very few calcu- 
lations of the effect: it is pretty clear, however, that the geometric term is no 
small correction. 

For pedagogical purposes we consider the case of a hydrogen atom, hence we 
identify the electronic degrees of freedom [x], used in Sec. 4.2.2, with a single 
coordinate r, and the parameter £ with the nuclear coordinate R. If the atom is 
subject only to a magnetic field, the complete Hamiltonian Ti and the electronic 
Hamiltonian H are 

H(R, r) = J- [p - - A(R)1 2 + H(R, r); (4.23) 
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H(R,r) = — -iftV r + -A(r) — . (4.24) 



c 



Rl 



As explained above, the nuclear kincmatical momentum of Eq. (4.15) becomes 

7v = p - HACR) - -A(R) (4.25) 

c 

The case of a constant B field can be dealt with analytically. We choose the 
central gauge A(r) = x r. If </>(r) is the exact ground eigenfunction when 
the proton sits at R = 0, the eigenfunction at a generic R is: 

(r|V(R)) = e -^ r - BxR (Hr - R), (4.26) 
with an R-indcpcndcnt eigenvalue. The Berry connection is clearly 

A(R) = *<^(R)|VrV(R)) = -^<V(R)|B x i#(R)> = -^-B x R 

= -±A(R), (4.27) 

since the R-derivative of (f>(r — R) does not contribute. Replacing Eq. (4.27) 
into Eq. (4.25) we find tv = p, as it must be: the nucleus travels at constant 
speed, and is not deflected by a Lorentz force. 

Remarkably, the "magnetic-like" field due to the Berry phase is — in this sim- 
ple example — exactly opposite to the external magnetic field, thus providing the 
complete screening which is physically expected. In less trivial situations, the 
screening affects significantly the molecular vibrations and the classical nuclear 
motion in general. The case of H 2 has been investigated in 2007 by Ceresoli et 
al. [67]. 
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4.4 Anomalous Hall effect 

Edwin H. Hall discovered the eponymous effect, nowadays in all solid state 
textbooks, in 1879. Shortly later, in 1881, he discovered that the effect in 
ferromagnetic materials is "anomalous" . While in nonmagnetic materials the 
Hall resistivity is linear in the magnetic field, in ferromagnetic ones it saturates 
to a value roughly proportional to the macroscopic magnetization. 

In 1954 Karplus and Luttinger [68] provided a theory of the anomalous Hall 
effect (AHE) which, with hindsight, was indeed geometrical. The understanding 
of AHE, however, remained controversial for another 40 years and more. One of 
the reasons for this state of affairs is that the intrinsic geometrical contribution 
is partly obscured by extrinsic contributions (jargon: skew scattering and side 
jumps), not easily disentangled in the experimental data. A comprehensive 
review appeared very recently [11]. Here we only outline the intrinsic theory, 
based on geometrical concepts, which owes to a couple of papers appeared in 
2002, by Jungwith, Niu, and MacDonald [69], and by Onoda and Nagaosa [70]. 

The starting point is the Kubo formula of Eq. (4.19), which we rewrite in 
dimension d as 



While in Sec. 4.2.3 this was integrated in the BZ, for a metal the k integral is 
limited to the volume inside the Fermi surface. Nonetheless, the transformation 
of the integrand, from a sum over states into a curvature, proceeds in the same 
way. The geometrical contribution to the AHE is therefore proportional to the 
integral of the Berry curvature within the Fermi surface. To allow for band 
crossings, we write the intrinsic AHE conductivity in dimension d as 



the analogy to Eqs. (4.20) and (4.21) is self evident. But this analogy is partly 
misleading: at variance with the quantum Hall case, no macroscopic B field is 
present here, and the u orbitals in Eq. (4.29) are the (periodic parts of) genuine 
Bloch orbitals. 

Given that the Fermi surface is symmetrical under k — ► — k, the symmetry 
considerations of Sec. 3.7 show that Eq. (4.29) can be nonzero only if time- 
reversal symmetry is broken, while inversion symmetry is irrelevant. The typical 
case studies are the ferromagnetic metals, whose ground state breaks indeed 
time-reversal symmetry in absence of a macroscopic B field. 

First-principle calculations were performed for Ni, Cu, and Fe, as well as 
or for some oxides. The intrinsic geometric contribution appears to be the 
dominant one. These calculations also pointed out the crucial role played by 
avoided crossings of the bands near the Fermi surface, which induce a very spiky 
behavior of the Berry curvature in the BZ. More than 10 6 k points where used 
in Ref. [71] in order to perform the integration in Eq. (4.29); a more efficient 
strategy was devised later [72] . 

A noninteracting (e.g. KS) many-electron system is a trivial example of 
a Fermi liquid. Haldane [73] pointed out that the very basic tenet of Lan- 
dau's Fermi-liquid theory is that charge transport involves only quasiparticles 




(4.28) 




(4.29) 
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with energies within k^T of the Fermi level. This is apparently at odds with 
Eq. (4.29), which is an integration over the whole occupied Fermi sea. The two 
viewpoints can be reconciled, essentially via an integration by parts [73]. Even 
this alternative form has been implemented in first-principle calculations [74]. 

4.5 Semiclassical transport 

The semiclassical theory of Bloch electron dynamics plays a fundamental role in 
the physics of metals and semiconductors, and is a typical textbook topic [75]. 
The theory addresses the motion of a wave packet built as a superposition of 
Bloch states from the n-th band 

\W) = [ dka(k,t)\ip nk ), (4.30) 
Jbz 

where the envelope function is well localized in k-space. Because of this, it 
is delocalized in r space; we assume, however, that its center of mass is well 
defined. Owing to this, we may define the wave vector k and the center r of the 
wave packet as 

k= ( dk! k!\a{k!,t)\ 2 - r = <W|r|W). (4.31) 
Jbz 

4.5.1 Textbook equations of motion 

In absence of collisions, the equations of motion reported in textbooks and 
routinely used in device engineering are 

r hdk 

hk = -e(E+-rxB), (4.32) 

c 

where e k is the band structure of the relevant band, and E and B are the 
perturbing fields, assumed weak and slowly varying in space and time. 

As emphasized in Ref. [75], the derivation of Eq. (4.32), despite the formal 
simplicity of the result, is "a formidable task". The early derivations date from 
the 1930s; the problem was reconsidered several times in the literature, by Slater 
[76], Luttinger [77], and Zak [78] among others. 

4.5.2 Modern equations of motion 

The ultimate analysis of semiclassical transport owes to a couple of papers by 
Q. Niu and coworkers [60, 61] (see also Ref. [12]); a couple of terms, detailed 
below, are missing in Eq. (4.32). 

The band structure acquires a correction due to the orbital moment m(k) 
of the wave packet: 

e k ^e k -m(k)B m(k) = - — (V k u k | x (ff k -e k )|V k u k ). (4.33) 
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Furthermore, the canonical momentum ftk has to be replaced with the kinetic 
momentum, which includes the (geometrical) vector potential. In the Newton- 
like equation of motion, its contribution is reminiscent of a Lorentz force in 
reciprocal space. Eq. (4.32) must then be replaced with 

hk = -e(E+-rxB), (4.34) 

c 

where fi(k) is the Berry curvature of the relevant band, having the dimensions 
of a squared length. Notice that the curvature is nonzero even in presence of 
time-reversal symmetry, provided that the crystal is noncentrosymmetric. 



4.5.3 Geometrical correction to the density of states 

In a remarkable 2005 paper by Xiao, Shi, and Niu [79] it was pointed out 
that Eq. (4.34), in presence of a nonzero B field, violates Liouville's theorem. 
This means that the volume element AV — ArAk changes in time during the 
evolution of the system; it is possible, however to remedy this shortcoming in 
an elegant way. 

The standard volume clement of the phase space in dimension d is 
ArAp//i d = (27r)- d ArAk. According to Ref. [79] this has to be modified 
by a geometrical term as: 

1 1 / 2ir \ 

-ArAk -> - — - 1 + — B • fi(k) ArAk, (4.35) 



(2ir) d (2ttY 

where 0o is the flux quantum. 

As a consequence of the modified density of states, the Fermi volume of a 
metal changes when a macroscopic B field is switched on at constant electron 
density. If instead we keep the chemical potential [i constant, then the electron 
density n depends on B. At zero temperature 

1 /*../. 2tt 



(2tt) c 



fdk ( 1 + ^B • O(k) ] - e k ) 

Jbz V 0o / 



s), " (iSF*;/* n(k > "<"- ft) ( " 6) 

The latter assumes a perspicuous meaning in 2d, when \i is in a gap: 

\oBJ^ 2tt0 o Jbz 0o ec 

For a quantum Hall system, this goes under the name of Streda formula [80], 
and had been first derived in 1982 in a very different way. 



4.6 Quantum transport 
4.6.1 Transport by a single state 

We are going to study here the current induced by an adiabatic change of the 
potential, or more generally of the Hamiltonian, in the single-particle case. We 
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indicate as tp n (t) the adiabatic instantaneous eigenstates, and with ip(t) the time 
evolution of the ground state. In order to get rid of the dynamical phase, it is 
better to deal with density matrices 

P (t) = \m)(m\ - \Mt))(Mt)\ + A P (t). (4.38) 

The velocity of this state is 

v(t) = Tr{p(t)v} (4.39) 

= (Mt)\ v \Mt)) + E^oWl a p^ \Mt))(Mt)\ v \Mt))- 



Since the adiabatic density matrix commutes with H(t), the time evolution 



is 



[H(t),Ap(t)\ = ihp(t)-iKj t \Mt))(Mt)\ 

= ih(\Mt))(Mt)\-\Mt))(Mt)\), 



(4.40) 



where we are neglecting a term of higher order in the adiabaticity parameter. 
We now take the matrix elements between (ipo(t)\ and \tp n (t)): 

(E -E n )(Mt)\&p(t)\Mt)) = ^((Mt)\Mt)) - (Mt)\Mt)))- (4.4i) 

The term with n = in the rhs vanishes because of norm conservation; replace- 
ment into Eq. (4.39) yields 



v(t) 



(Mt)\v\Mt)) 

~(Mt)\Mt))(Mt)\v\Mt)) 



n#0 



Eq — E n 



— CC. 



(4.42) 



The first term on the rhs is zero in the special case where — at all times — the 
Hamiltonian H(t) is time-reversal invariant and the state is nondegenerate. 



4.6.2 Current carried by filled bands 

We now exploit the previous result for a system of noninteracting electrons in 
the case where the Hamiltonian H(t) is lattice periodical and the ground state 
is insulating; this means that the gap remains finite at all t. It will be enough 
to consider the simple case of just one filled band, with band index zero; the 
current carried by each Bloch orbital IV'ok) = e lk r |uok) is 



Vk(*) = (V'okl v l^ok) + ih ^2 

j¥0 L 

= (itokl y Kk) +ih^2 



(V'Okl^jkKV'jkl V I^Ok) 
eok - £jk 

MokKkX^jkl v Kk) _ 
eok — e?k 



c.c. 



c.c. 



(4.43) 



where the t dependence of the rhs is now implicit. We then adopt the usual 
formula for the velocity, Eqs. (1.18) and (4.18), and the analog of Eq. (3.27): 



v = -Vk#k, |V k u ok ) = >^ lujk}^-^ — ! - 



h V 1 



e 0k — e?k 



(4.44) 
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Vk(t) = ^ v k£ok + i ( (wok|V k u k) - (V k Mok|itok) )• (4.45) 

The first term on the rhs integrates to zero over the BZ, while the second is 
clearly a Berry curvature component in the four-dimensional k, t domain. The 
current density carried by a filled band in dimension d is 

%G f 

= ~7o~Xd / dk ( ("ok|V k u k) - (V k wokKk) ); (4.46) 
( Z7T ) Jbz 

the Bloch states are normalized to one in the crystal cell (as everywhere in the 
present Notes). 

4.6.3 Quantization of charge transport 

Let us consider the special case of a simple cubic crystal with lattice constant 
a. The transported charge in time T in the z direction across one cell is 

f T f T ea 2 r /a r /a 

Q = / dtl z (t) = a 2 dtj z (t) = --—z dk x dk y 

JO JO ( 27r ) J-Tr/a J-Ti/a 

i ,T ,7r/a 

X 7T dt dk * ( («0k|9 z U k) - (9 z U k|M0k) )• (4.47) 

27r JO J-7r/a 

If the time evolution of the Hamiltonian is cyclic H(T) = H(0), then the second 
line in Eq. (4.47) is clearly a Chern number C (in the k z ,t variables) and is 
integer. Notice also that C is dimensionless, and therefore does not depend on 
how fast the Hamiltonian varies with time; ideally, the adiabatic regime means 
T -> oo. 

We arrive therefore at the outstanding result 

Q= f dtl z (t) = ex integer (4.48) 
Jo 

first proved by Thouless in 1983 [81]; it holds of course for any dimension d. 
Let me restate the theorem: if the Hamiltonian is changed adiabatically in such 
a way that it returns to its starting value in time T, the transported charge 
in an infinite periodic system is quantized provided that the system remains 
insulating at all times. A cycle pumps an integer number of elementary charges 
across the system. 

Among the examples which realize a "Thouless pump", the original paper 
suggests a sliding charge-density wave. A more outstanding manifestation of 
quantized charge transport was pointed out shortly afterwards by Pendry and 
Hodges [82]: Faradays' laws of of electrolysis (1832). The mass/charge transfer 
ratio shows that charge is always transported in units of e per ion, to the extent 
that electrolytic cells are used as standards of current. If a given ion sits at one 
electrode at t = 0, and if it drifts to the other one at t = T, the Hamiltonian 
can be considered as cyclic, whence charge quantization follows. However, at 
intermediate t values the charge "belonging" to a given ion is definitely non 
quantized, and arbitrarily defined: for a review of the possible definitions, see 
Rcf. [83]. 

Thouless quantization of charge transport [81], discussed above, also has 
profound relationships to later advances: namely, to the topological explanation 
of the quantization of surface charge (discussed in Sec. 5.3.4), and to the modern 
theory of polarization (discussed in Sec. 5.3). 
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Chapter 5 

Modern theory of polarization 



The macroscopic polarization P is a fundamental concept that all undergradu- 
ates learn about in elementary courses [84, 85] . In view of this, it is truly ex- 
traordinary that until rather recently there was no generally accepted formula 
for P in condensed matter, even as a matter of principles. P is an intensive 
vector quantities that intuitively carries the meaning of dipole per unit volume. 
Most textbooks [75, 86] provide a flawed definition of P, not implementable in 
practical computations [87] . 

A genuine change of paradigm was initiated by a couple of important pa- 
pers [88, 89], after which the major development was introduced by King-Smith 
and Vanderbilt in 1992 (paper published in 1993 [90]). Other important ad- 
vances occurred during the 1990s [91, 92] and the so-called "modern theory of 
polarization" it is at a mature stage since more than a decade; several reviews 
have appeared in the literature over the years [1, 2, 3, 5, 6, 7, 8]. 

Aiming at a computational physics readership, it is worth emphasizing that 
most ab-initio electronic-structure codes on the market, for dealing with ci- 
ther crystalline or noncrystalline materials, implement the modern theory of 
polarization as a standard option. A nonexaustive list includes ABINIT [56], 

CRYSTAL [57], QUANTUM-ESPRESSO [59], SIESTA [93], VASP [58], and CPMD [94]. 

Implementations of the modern theory have been instrumental since more than 
a decade in the study of ferroelectric and piezoelectric materials [95, 96, 97]. 

The basic concepts of the modern theory of polarization also start reaching 
a few textbooks [98], though very slowly; most of them are still plagued with 
erroneous concepts and statements. 

5.1 Polarization and fields 

The modern theory of polarization, at least in its original form, only addresses 
the polarization P in a null macroscopic E field; in this case P can be nonzero 
only if the medium breaks inversion symmetry. This applies to noncentrosym- 
mctric crystals, and more generally to cases where inversion symmetry is broken 
by some perturbation (as e.g. a frozen phonon, or piezoelectric strain). 

It must be realized that, insofar as we address an infinite system with no 
boundaries, the E field is quite arbitrary. The microscopic charge density is 
neutral in average and lattice periodical; the value of E is just an arbitrary 
boundary condition for the integration of Poisson's equation. The usual choice 
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Figure 5.1: Macroscopic polarization P in a slab normal to z, for a vanishing 
external field E^ ext ^. Left: When P is normal to the slab, a depolarizing field 
E = — 47rP is present inside the slab, and charges at its surface, with areal 
density cr sur f acc = P n Right: When P is parallel to the slab, no depolarizing 
field and no surface charge is present. 



(performed within all electronic-structure codes) is to impose a lattice-periodical 
Coulomb potential, i.e. E = 0. Imposing a given nonzero value of E is equally 
legitimate (in insulators), although technically more difficult [99, 100]). 

When addressing a finite sample with boundaries, the E field is in princi- 
ple measurable inside the material, without reference to what happens at the 
sample boundary; this is not the case of D. In fact, E obtains by averaging 
over a macroscopic length scale the microscopic electric field E< micro )(r), which 
fluctuates at the atomic scale [85]. In a macroscopically homogeneous system 
the macroscopic field E is constant, and in crystalline materials it coincides with 
the cell average of E( mlcro )(r). A lattice-periodical potential enforces E = 0; for 
a supercell calculation, this applies to the field average over the supercell, while 
in different regions there can be a nonzero macroscopic field. 

As explained so far, there is no need of addressing finite samples and external 
vs. internal fields from a theoretician's viewpoint. Nonetheless a brief digression 
is in order, given that experiments are performed over finite samples, often in 
external fields. Suppose a finite macroscopic sample is inserted in a constant 
external field E( ext ): the microscopic field E( mlcro )(r) coincides with E( cxt ) far 
away from the sample, while it is different inside because of screening effects. 
If we choose an homogeneous sample of ellipsoidal shape, then the macroscopic 
average of E( mlcro )(r), i.e. the macroscopic screened field E, is constant in 
the bulk of the sample. The shape effects are embedded in the depolarization 
coefficients [84]: the simplest case is the extremely oblate ellipsoid, i.e. a slab of 
a macroscopically homogeneous dielectric; more details are given in Ref. [8]. For 
the slab geometry in a vanishing external field E^ cxt ^ the internal field E vanishes 
when P is parallel to the slab (transverse polarization), while E = — 47rP is the 
depolarization field when P is normal to the slab (longitudinal polarization): 
see Fig. 5.1. 

5.2 Polarization "itself" vs. polarization differ- 
ence 

Novel ideas about macroscopic polarization emerged in the early 1990s [88, 89]; 
these led to the modern theory, based on a Berry phase, which was founded 
by King-Smith and Vanderbilt soon afterwards [90]. At its foundation, the 
modern theory was limited to a crystalline system in an independent-electron 
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Figure 5.2: Top panel: The 14- 
atom BeO supercell in a ver- 
tical plane through the BeO 
bonds; the wurtzite (W) and 
zincblcndc (ZB) stackings are 
perspicuous. Bottom panel: 
Macroscopic averages of the va- 
lence electron density (solid) 
and of the electrostatic poten- 



«N tial (dotted). 



framework (either KS or Hartree-Fock). Later, the theory was extended to 
correlated and/or disordered systems [91, 92]. 

The first calculation ever of spontaneous polarization was published in 
1990 [88]. The case study was BeO: it has the simplest structure where in- 
version symmetry is absent (i.e. wurtzite), and furthermore its constituents 
are first-row atoms. The idea was to address the macroscopic polarization of 
a slab of finite thickness, with faces normal to the c axis, embedding it in an 
ad hoc medium which (i) has no bulk polarization for symmetry reasons, and 
(2) does not produce any geometrical or chemical perturbation at the interface. 
The optimal choice is a fictitious BeO in the zincblende structure. Because of 
obvious reasons, the system is periodically replicated in a supercell geometry 
(Fig. 5.2, top panel). The selfconsistent calculation shows well localized interface 
charges, of opposite sign and equal magnitudes at the two noncquivalent inter- 
faces (Fig. 5.2, bottom panel). The interface charge is related to the difference 
in polarization between the two materials: cxi n torfacc = AP • n. The computer 
experiment provides the value of Uinterface, and since P vanishes by symmetry 
in the zincblcndc slab, one thus obtains the bulk value of P in the wurtzite 
material. Notice that here P is a longitudinal polarization, in a depolarizing 
field. 

It must be emphasized that the quantity really "measured" in this computer 
experiment is AP, not the polarization P itself. After Ref [88] was published, 
a study of the experimental literature showed that — contrary to an incorrect 
widespread belief — no experimental value of P in any wurtzite material ex- 
ists: only estimates are available. Ref. [88] marks, as said above, a change of 
paradigm: polarization must be defined by means of differences, and the con- 
cept of polarization "itself" must be abandoned. With hindsight, it is nowadays 
pretty clear that the problem, as well as its solution, exists already at the clas- 
sical level: this is sketched in Fig. 5.3. Most textbooks are missing this very 
basic fact. 

The modern theory avoids addressing the "absolute" polarization of a given 
equilibrium state, quite in agreement with the experiments, which invariably 
measure polarization differences. Instead, the theory addresses differences in 
polarization between two states of the material that can be connected by an 
adiabatic switching process. The time-dependent Hamiltonian is assumed to 
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(a) 



*■ < b ) Figure 5.3: A Id solid with infinite 

• ••••• length. Different choices of the unit cell 

give different P values: (a), (b). On the 
■* AP other hand, the change of polarization 

AP does not depend on the choice of 



the unit cell (c). 



remain insulating at all times, and the polarization difference is then defined 
[89] as the time-integrated transient macroscopic current that flows through the 
insulating sample during the switching process: 

rAt 

AP = P(At) — P(0) = / dt]{t). (5.1) 
Jo 

In the adiabatic limit At — > oo and j(i) — > 0, while AP stays finite. Ad- 
dressing currents (instead of charges) explains the occurrence of phases of the 
wavefunctions (instead of square moduli) in the modern theory. Eventually the 
time integration in Eq. (5.1) will be eliminated, leading to a two-point formula 
involving only the initial and final states. 



5.3 Independent electrons 

5.3.1 The King-Smith and Vanderbilt formula 

For a crystalline system of independent electrons the expression for the tran- 
sient current occurring in Eq. (5.1) is precisely the same as previously derived 
for quantum transport, Eq. (4.46). Therefore for one band (index 0) and sin- 
gle occupancy in dimension d the electronic contribution to the polarization 
difference is 

ie r At r 

AP = -jT-rj / dt / dk ( (M k|V k u ok ) - (VkMokKk) ); (5.2) 
I^ttJ Jo Jbz 

the (classical) nuclear contribution must be added separately. We remind that 
crystal-cell neutrality is essential. Notice also that, given the occurrence of 
Bloch states, the Hamiltonian is lattice periodical at all t: this implicitly means 
that in Eq. (5.2) AP is evaluated at E = 0. 

It is now expedient to introduce a dimcnsionlcss adiabatic time A, with 
\ Uj u(t)) = |u ik (A(t))), A(0) = 0, A(l) = At. Eq. (5.2) becomes then, for n 
doubly occupied bands (index j = l,n) in 3d: 



AP = P(l) -P(0) = fdXdxP(X) 

Jo 
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It is essential that the gap does not close, i.e. the system remain insulating, for 
all A values. 

The expression in Eq. (5.3) can be integrated with respect to A to obtain 



P(A) = 



lie 
(2^ 



£ 

i=i 



dk (ujk|Vk«jk) : 

BZ 



(5.4) 



this is the (by now famous) King-Smith and Vanderbilt formula [90], yielding 
the polarization of the final state minus the polarization of the initial state, 
Eq. (5.3). To understand the meaning of the k integral in 3d we take the simple 
example of a simple cubic lattice of constant a, similarly to Eq. (4.47): 



2 « fir/a ,-Tr/a 

^( A ) = -7 9 7Z W E / dk * \ dk v 1 

\ Zl1 ) ~[J~K/a J-TT/a [J- 



it I a 

i I dk z {uj^\d kz u 3 ^) 

-7T / a 



(5.5) 



where the square parenthesis highlights the Berry-phase, to be compared to 
Eq. (3.43). 

In first-principle implementations, the Berry phase is discretized as in 
Eqs. (3.45) and (3.46), and the remaining 2d k-integral is discretized in the 
trivial way. The first calculation ever of the "spontaneous" polarization of a 
ferroelectric material (KNbOs) appeared in 1993 [101], and agreed within 10% 
with the measured values. As said above, the Berry-phase formula is nowadays 
implemented in most first-principle codes. 



5.3.2 The quantum of polarization 

Given that every phase is defined modulo 2tt, all of the two-point formulas for 
AP in terms of Berry phases are arbitrary modulo a polarization "quantum" . 
This is the tradeoff one has to pay when switching from the curvature formula, 
Eq. (5.3) — where no such arbitrariness exists — to the two-point King-Smith and 
Vanderbilt formula, where only the connection occurs. The actual arbitrariness 
of AP in 3d is 2eR/V r co n, where R is a lattice vector and V cc \\ is the cell volume 
(the 2 factor owes to double band occupancy). A similar arbitrariness of an 
integer times eR/Kcii occurs for the classical nuclear contribution to polariza- 
tion. 

The quantum arbitrariness is rarely a problem in practice. In most cases, the 
change in P that can be induced by a perturbation, such as a small sublattice 
displacement, is insufficient to cause P to change by a large fraction of the 
quantum. Where exceptions exist the ambiguity can be resolved by subdividing 
the adiabatic path into several shorter intervals, for each of which the change 
in P is unambiguous for practical purposes. Additional problems may occur in 
the discretized version of the Berry-phase formula: this is discussed e.g. in Ref. 
[8] and, in more detail, in Ref. [7] . 

Here we stress that the quantum ambiguity is an essential aspect of the 
theory. For example, for the case of a closed cyclic adiabatic evolution of the 
system, in which the parameter values A = and A = 1 label the same physical 
state of the system, we retrieve the quantization of charge transport, discussed 
in Sec. 4.6, and governed by a Chern number. 
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5.3.3 Wannier functions 



The KS (or Hartee-Fock) ground state is a Slater determinant of doubly occupied 
orbitals; any unitary transformation of the occupied states among themselves 
leaves the determinantal wavefunction invariant (apart for an irrelevant phase 
factor), and hence it leaves invariant any KS ground-state property. 

For an insulating crystal the Bloch KS orbitals of completely occupied bands 
can be transformed to localized Wannier orbitals (or functions) WFs. This is 
known since 1937 [102], but for many years the WFs have been mostly used as a 
formal tool; they became a popular topic in computational electronic structure 
only after the seminal work of Marzari and Vanderbilt [103]. A comprehen- 
sive review appeared as Ref. [18], and a public-domain implementation is in 
wannier90 [104]. If the crystal is metallic, the WFs can still be technically 
useful [105], but it must be emphasized that the ground state cannot be writ- 
ten as a Slater determinant of localized orbitals of any kind, as a matter of 
principle [106]. 

The transformation of the Berry phase formula in terms of WFs provides an 
alternative, and perhaps more intuitive, viewpoint. The formal transformation 
was known since the 1950s [107], although the physical meaning of the formalism 
was not understood until the advent of the modern theory of polarization. 

The unitary transformation which defines the WF WjR,(r), labeled by band 
j and unit cell R, within our normalization is 



If one then defines the "Wannier centers" as r^R = (wjn\r\wjn) 7 it is rather 
straightforward to prove that Eq. (5.4) is equivalent to 



This means that the electronic term in the macroscopic polarization P is (twice) 
the dipole of the Wannier charge distributions in the central cell, divided by the 
cell volume. The nuclear term is obviously similar in form to Eq. (5.7); the sum 
of both terms is charge neutral. 

WFs are severely gauge- dependent, since the phases of the \ipjk) appearing in 
Eq. (5.6) can be chosen arbitrarily. However, their centers are gauge-invariant 
modulo a lattice vector. Therefore P^ 01 ) in Eq. (5.7) is affected by the same 
"quantum" indeterminacy discussed above. We also stress, once more, that 
Eq. (5.7) — as well as Eq. (5.3) — is to be used in polarization differences, and 
does not define polarization itself. 

The modern theory, when formulated in terms of WFs, becomes much 
more intuitive, and in a sense vindicates the venerable Clausius-Mossotti view- 
point [108]: in fact, the charge distribution is partitioned into localized contri- 
butions, each providing an electric dipole, and these dipoles yield the electronic 
term in P. However, it is clear from Eq. (5.6) that the phase of the Bloch or- 
bitals is essential to arrive at the right partitioning. Any decomposition based 
on charge only is severely nonunique and does not provide in general the right 
P, with the notable exception of the extreme case of molecular crystals. 




(5.6) 




(5.7) 
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In the latter case, in fact, we may consider the set of WFs centered on a 
given molecule; their total charge distribution coincides — in the weakly inter- 
acting limit — with the electron density of the isolated molecule (possibly in a 
local field). This justifies the elementary Clausius-Mossotti viewpoint. It is 
worth mentioning that the dipole of a polar molecule is routinely computed in a 
supercell geometry via the single-point Berry phase discussed below [109]. The 
dipole value coincides with the one computed in the trivial way in the large 
supercell limit. Finite-size corrections, due to the local field (different in the 
two cases), can also be applied [110]. 

The case of alkali halides — where the model is often phenomcnologically 
used — deserves a different comment [8]. The electron densities of isolated 
ions (with or without fields) are quite different from the corresponding WFs 
charge distributions, for instance because of orthogonality constraints: hence 
the Clausius-Mossotti model is not justified in its elementary form, despite con- 
trary statements in the literature. For a detailed analysis, see Ref. [111]. 

5.3.4 The surface charge theorem 

The early occurrences of the theorem of quantization of the surface charge [32, 
33, 34, 35] are discussed above, Sec. 2.3. The topological nature of this theorem 
was first realized by Niu [112] in 1986; here we follow the treatment of Vanderbilt 
and King-Smith [113] (see also Ref. [5]). 

According to elementary electrostatics the macroscopic bound surface charge 
density cr sur f ace residing on the surface of a sample is related to the polarization 
in the interior by <x sur f ace — h ■ P, where n is the surface normal. One defines 
the bound charge cr sur f aco by saying that no free charge is present, but what, 
precisely, does this mean? The surface must be insulating, with the electron 
chemical potential lying in a gap that is common to both bulk and surface. 
But this is not a unique prescription, since there can be a surface band which 
is entirely occupied or entirely empty. The two cases differ by a polarization 
quantum in the corresponding P value. In fact, given that the bulk polarization 
P is arbitrary modulo eH/V cc n, it follows that the charge per surface area is 
defined modulo a quantum 

o"surface = n • P modulo — — . (5.8) 

^■surface 

An equivalent formulation of the surface charge theorem can be arrived at by 
means of WFs. The WF approach is most perspicuous for quasi-ld systems (e.g. 
insulating polymers); a pedagogical presentation is in Ref. [114]. Notice that 
there cannot be surface states in a polymer: all terminations are "insulating" . 

The bulk-surface correspondence encoded in Eq. (5.8) is an outstanding man- 
ifestation of topology in condensed matter physics: the surface charge of an in- 
sulating surface is "topologically protected" . The actual value of o- sur f a ce among 
the discrete allowed values is then determined by energy considerations. 

For a centrosymmetric crystal it is tempting to guess that P(A) in the Berry- 
phase formula, Eq. (5.4), vanishes for any A. Instead, this is not the case: 
centrosymmetry only dictates that P = P modulo a quantum, and therefore 
Eq. (5.4) yields P equal to an integer or half integer multiple of the quantum 
eR/Vceii (for single band occupancy). Then from Eq. (5.8) it follows that the 
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charge per surface cell may only be an integer or half integer, as first discov- 
ered many years ago, and previously discussed in Sec. 2.3. Therein, it was 
observed that this important theorem is often ignored even by specialists in sur- 
face physics. A thorough analysis of polar surfaces, in the light of the present 
theorem, has recently been published by Stengel [115]. 

5.3.5 Noncrystalline systems: The single-point Berry 
phase 

The key ingredient for computing the infrared spectrum of amorphous or liquid 
systems is the power spectrum of the autocorrelation function of the macro- 
scopic polarization (P(t) • P(0)). Since Car-Parrinello simulations are custom- 
arily performed using only k = in the (supercell) BZ, we need to analyze the 
single-point version of the Berry-phase formula for polarization, Eq. (5.5). 

We consider a simple cubic supercell of side L; in the large L limit the BZ 
integral of any function /(k) is approximated as 

/dk/(k) ^/(0). (5.9) 

We start with the Berry phase, i.e. with the Id integral in square parenthesis 
in Eq. (5.5): 

j z =i J dk z {ujk\dk z ujk) = i j dk^Uj^d^Uj^) -> -Im log det 5(ki,k 2 ), 

(5.10) 

where ki = (0,0,0) and k 2 = (0,0,2tt/L). In Eq. (5.10) we have used the 
discrctized Berry phase, Eq. (3.46), with only one factor in the matrix product. 
Then, as in Sec. 3.9, we notice that |itjk 2 ) = e ~ l ~ ir \ u jk.i)- therefore the overlap 
matrix in Eq. (5.10) becomes 

S jj ,(k 1 ,k 2 ) = (u j \e- i ^\u r ), (5.11) 

where all the orbitals \uj) — \ipj) are evaluated at k = 0. We then approximate 
even the remaining integrals in Eq. (5.5) with a single point. At any time during 
the simulation the electronic term in the polarization is thus 

Pf\t) = = ^Im log det S. (5.12) 

The nuclear (or core) contribution has a very simple form. If z m is the instan- 
taneous z coordinate of the ra-th nucleus, and eZ m the corresponding charge, 
the total polarization is 

Pz(t) = ^Im log det S + Z ™ z ™- ( 5 ' 13 ) 

m 

This the expression currently used in computing power spectra and infrared 
spectra [116], and, more generally, whenever a single k point is used in the 
first-principle simulations [117, 109]. 

The polarization quantum in Eq. (5.13) is e/L 2 , which vanishes in the L — > oo 
limit. This does not make any problem, and in fact Eq. (5.12) is routinely used 
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for evaluating polarization differences in noncrystalline materials. The key point 
is that the L — > oo limit is not actually needed; for an accurate description of 
a given material, it is enough to assume a finite L, actually larger than the 
relevant correlation lengths in the material. For any given length, the quantum 
e /L 2 sets an upper limit to the magnitude of a polarization difference accessible 
via the Berry phase. The larger are the correlation lengths, the smaller is the 
accessible AP. This is no problem at all in practice, either when evaluating 
static derivatives by numerical differentiation, such as e.g. in Ref. [117, 109], 
or when performing Car-Parrinello simulations [116]. In the latter case At is 
a Car-Parrinello time step (a few a.u.), during which the polarization varies 
by a tiny amount, much smaller than the quantum (the typical size of a large 
simulation cell nowadays is L ~ 50 bohr) . Whenever needed, the drawback may 
be overcome by splitting At into several smaller intervals. 

5.4 Correlated wavefunctions 

The treatment given so far assumes an independent-particle scheme, where po- 
larization is evaluated as a Berry phase of one-electron orbitals, typically the 
KS ones. Shortly after the appearance of the King-Smith and Vanderbilt paper 
[90], Ortiz and Martin [91] provided the many-body generalization of the theory, 
where polarization is expressed as a Berry phase of the many-body wavefunc- 
tion. 

A subsequent development [92] provides a unified treatment of macroscopic 
polarization, dealing on the same footing with either independent-electron or 
correlated systems, and with either crystalline or disordered systems. 

5.4.1 Single-point Berry phase again 

Suppose we have N electrons in a cubic box of volume L 3 , with a many-body 
Hamiltonian 



and eigenfunctions |* n ), normalized in the hypercube of volume L 3N . In 
Eq. (5.14) the potential V includes one-body and two-body (electron-electron) 
contributions. As usual in condensed-matter theory, we adopt periodic Born- 
von-Karman boundary conditions over each electron coordinate indepen- 
dently, whose Cartesian components ri_ a are then equivalent to the angles 
27rri iQ /L. The potential V enjoys the same periodicity, which implies that the 
electric field averages to zero over the sample. 

The formula for correlated wavefunctions is quite similar to Eq. (5.12), which 
in fact is a special case of the latter. The starting ingredient is the single-point 
Berry phase 

lz = Im log (* |e^2:. z *|* ) = _I m log (* |e-^ ^ 2i |* ), (5.15) 
and the polarization formula obtains by replacing 7 Z in Eq. (5.13), or equiva- 




(5.14) 



lently 



*¥E 4 *i|* ). 



(det S) 2 -» Im log (* |c 



(5.16) 
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Pi ol) = log (^ole-^^-^o). (5.17) 

The proof of Eq. (5.17) is provided in the original paper [92], as well as in some 
review papers [3, 6, 7, 8]. 

Here we content ourselves to prove that Eq. (5.17) coincides with Eq. (5.12) 
in the special case where l^o) is the Slater determinant of the (doubly occu- 
pied) k = orbitals \uj) — \ipj). The key observation is that the many-body 
wavefunction 

|* > =e _i Tf E «**|*o> (5-18) 
is the Slater determinant built from the orbitals \v,j) = e~ %2 ^\uj), hence 

(tt |e-** £«**|* ) = (tt |*o) = (det(u,|^)) 2 = (dct S) 2 , (5.19) 
where the second power owes to double orbital occupancy. 

5.4.2 Kohn-Sham polarization vs. real polarization 

All of the independent-electron formulas discussed in Sec. 5.3 are exact for 
noninteracting electrons, but the obvious aim is to implement them with KS or- 
bitals, in a given density-functional theory (DFT) framework. Since macroscopic 
polarization applies to insulators only, we stress that we mean "KS insulator" 
throughout: that is, we assume that the KS spectrum is gapped. In the class of 
"simple" (i.e. computationally friendly) materials a genuine insulator is also a 
KS insulator, although pathological cases (computationally unfriendly) do exist. 

Having specified this, the key issue is then: Does the KS polarization coincide 
with the physical many-body one? The answer is subtle, and is different whether 
one chooses either "open" boundary conditions, as appropriate for molecules 
and clusters, or periodic boundary conditions (Born- von Karman), as invariably 
done in the present Notes. 

Within open boundary conditions the KS orbitals vanish at infinity, as well 
as the charge density of the sample. P is then the first moment of the charge 
density, divided by the sample volume. The basic tenet of DFT is that the 
microscopic density of the fictitious noninteracting KS system coincides with the 
density of the interacting system. Therefore the exact P coincides by definition 
with the one obtained from the KS orbitals. 

Matters are quite different within periodic boundary conditions: we have 
seen above that P is not a function of the charge density, hence the value of 
P obtained from the KS orbitals, in general, is not the correct many-body P. 
This was first shown in 1995 by Gonze, Ghosez, and Godby [118], and later 
discussed by several authors. A complete account of the issue can be found in 
Ref. [6]. Here we just mention that the exact P is provided by Eq. (5.17), while 
the KS P is provided by Eq. (5.12), where the KS orbitals enter Eq. (5.11); both 
expressions are to be evaluated in the large- L limit. The two expressions are 
clearly different whenever the ground wave function is not a Slater determinant. 

Therefore P cannot be exactly expressed within DFT, but the exact func- 
tional is obviously inaccessible, and even sometimes pathological. The practical 
issue is whether the current popular functionals provide an accurate approxi- 
mation to the experimental values of P in a large class of materials. 

A vast first-principle literature has accumulated over the years by either 
linear-response theory [119] — not reviewed here — or by the modern theory. The 
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errors are typically of the order of 10-20% on permittivity, and much less on most 
other properties (infrared spectra, piezoelectricity, ferroelectricity) for many 
different materials. It is unclear which part of the error is to be attributed to 
DFT per se, and which part is to be attributed to the approximations to DFT. 
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Chapter 6 



Quantum metric and the theory of the 
insulating state 



6.1 Nongeometrical theories of the insulating 
state 

The standard textbook approach to the insulating state of matter is based on 
band theory. Following Bloch's theorem [120] in 1928, the main result is due 
to Wilson in 1931 [121]. The single-particle spectrum of a lattice-periodical 
Hamiltonian is in general gapped, and the electron count determines where the 
Fermi level lies. If it crosses a band one has a conductor: an applied electric field 
induces free acceleration of the electrons (at T = in absence of dissipation). 
If the Fermi level lies instead in a gap, one has an insulator: in presence of 
a field the electronic system polarizes, but no steady-state current flows for 
T — > 0. This very successful theory explains the insulating/conducting behavior 
of most common materials across the periodic table, for which band structure 
calculations became soon available. 

At the root of band theory are two basic assumptions: the electrons are 
noninteracting (in a mean- field sense), and the solid is crystalline. By the early 
1960s, however, it became clear that there are solids where these two assump- 
tions are very far from the truth, and where the insulating behavior is due to 
completely different mechanisms. The works of Mott in 1949 [122] and of An- 
derson in 1958 [123] opened new avenues in condensed matter physics. In the 
materials which we now call Mott insulators the insulating behavior is due to 
electron correlation [124], while in those called Anderson insulators it is due to 
lattice disorder [125]. 

In a milestone paper appeared in 1964 Kohn [126] provided a more compre- 
hensive characterization of the insulating state of matter, which encompasses 
band insulators, Mott insulators, Anderson insulators, and eventually any kind 
of insulating material. According to Kohn, the electrons in the insulating state 
satisfy a many-electron localization condition [127]. This kind of localization 
must be defined in a subtle way given that, for instance, the Hamiltonian eigen- 
states in a band insulator are obviously not localized. According to the original 
Kohn's formulation, the insulating behavior arises whenever the ground-state 
wavefunction of an extended system breaks up into a sum of contributions which 
are localized in essentially disconnected regions of the many-electron configura- 
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tion space. 

Kohn's theory remained little visited for many years [128] until the 1990s, 
when a breakthrough occurred in electronic structure theory: the modern theory 
of polarization. Inspired by the fact that electrical polarization discriminates 
qualitatively between insulators and metals, Resta and Sorella [129] in 1999 
provided a definition of many-electron localization rather different from Kohn's, 
deeply rooted in the theory of polarization, and therefore based on geometrical 
concepts. Their program was completed soon after by Souza, Wilkens and 
Martin [130] (hereafter quoted as SWM), thus providing the foundations of the 
modern theory of the insulating state. An early review paper appeared in 2002 
[4], and a recent one in 2011 [9]; the latter is at the root of the present Chapter. 

6.2 Metric-curvature tensor 

As in Sec. 5.4 we address an interacting iV-electron system, whose most general 
Hamiltonian we write, in the Schrodinger representation and in Gaussian units, 

as 



Equation (6.1) is exact in the nonrclativistic, infinite-nuclear-mass limit. With 
respect to Eq. (5.14), the velocity is augmented with two terms: A(r) is a vector 
potential of magnetic origin, and k, having the dimensions of an inverse length, 
is a "flux" or "twist" of the same kind as the single-particle flux thoroughly 
discussed in Sec. 1.6. 

In Sec. 5.4 we adopted periodic Born-von-Karman boundary conditions 
(PBCs) over a cubic supercell of side L, and the eigenfunctions \^ n ) were nor- 
malized in the hypercube of volume L 3N . In this Chapter, instead, it is expedi- 
ent to consider in parallel both "open" boundary conditions (OBCs) and PBCs. 
The former are appropriate to molecular physics, and require that the many- 
electron wavefunction of a bound state is square-integrable over the whole co- 
ordinate space R 3Ar . PBCs are instead appropriate for extended systems, either 
crystalline or disordered, either independent-electron or correlated. It is worth 
emphasizing that the choice of boundary conditions (either OBCs or PBCs) 
corresponds to choosing the Hilbert space, which in turn affects profoundly the 
geometrical properties discussed in Ch. 3. 

6.2.1 Open boundary conditions 

The case of OBCs is by far the simplest. We write for the sake of simplicity 
the ground state of Eq. (6.1) at k = as l^o) = 1*0(0)), and we define the 
many-body position operator as 



As in Sec. 1.6.2 the flux is easily "gauged away": the state e lK ' T \^o) coincides 
with the ground eigenstate |\f , o( K )) of the twisted Hamiltonian, Eq. (6.1). It 



H(k) = — £ \pt + -A(r0 + hn\ 2 + V. 



(6.1) 




(6.2) 
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is legitimate to multiply this eigenvector by any K-dependent (and position- 
independent) phase factor; our choice is then 

|*o(K))=c-^( f - d )|vI/ ), (6.3) 

where d = (*&o\r |\I>o} is the electronic dipole of the molecular system. It follows 
that the K-derivative needed in our metric-curvature tensor, Eq. (3.24) is 

|Vk* ) = -*(* - d)|* > = -»3(0) f |*o>, (6.4) 

Exploiting the idempotency of the projector, the tensor F a p, evaluated at k = 0, 
is 

^(0) - (*o \? a Q(0)rp\* o ) 

= (^o\f a f \^ o ) - (*o|ra|*o)(*o|r/3|*o), (6.5) 

where Q is the projector over the unoccupied many-body eigenstates. .F(O) 
clearly a real symmetric tensor, hence the Berry curvature vanishes within 
OBCs, and the metric g a p(0) coincides with F a p{fy- It will be shown that 
within PBCs, instead, the tensor F a p(0) may acquire a nonvanishing imaginary 
part. 

Clearly, the metric tensor in Eq. (6.5) is the cumulant second moment of the 
position operator, or equivalently the ground state fluctuation of the dipole of 
the molecular system; this quantity is extensive (scales like N in macroscopically 
homogenous systems). We anticipate that !F a p{Q)/N discriminates, in the large 
N- limit, between insulators and metals. 

6.2.2 Periodic boundary conditions 

First of all, a key observation about the position operator is in order. The simple 
multiplicative operator f, as defined in Eq. (6.2), is "forbidden" within PBCs: 
in fact it maps any periodic wavefunction |^) into the nonperiodic function 
r|*). In other words, it maps a function which belongs to the Hilbert space 
to a function outside of it [92]. Incidentally, this is the main reason why the 
polarization problem has remained unsolved until the early 1990s. In the present 
context, formulae like Eq. (6.5) are ill defined and absurd within PBCs. 

Within PBC the state e~ tK r \^o) is not an eigenstate of Eq. (6.1), except 
for the discrete k values 

K mim2m3 = ^(miei + m 2 e 2 + m 3 e 3 ), (6.6) 

where m a € Z, and e a are the Cartesian versors. For fractional values of k — i.e. 
for values different from those in Eq. (6.6) — the /^-dependence of the eigenvalues 
and eigenvectors of Eq. (6.1) is nontrivial. We have thoroughly discussed the 
single-particle version of this feature in Sec. 1.6.3. 

If |*o( K )) is the genuine ground eigenstate of Eq. (6.1) within PBCs, then 
the auxiliary function \^>o(k)) = e 4,t ' r |5'o( K )) is a solution of H(0), but ful- 
fills quasi-periodic boundary conditions: at any two opposite faces of the cube 
the wavefunction differs by a K-dependent phase factor. In other words the 
problem can be formulated in two equivalent ways: either the Hamiltonian is 
M-dependent, as in Eq. (6.1), and the boundary conditions are k- independent; 
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or the Hamiltonian is K-indcpcndcnt but the boundary conditions are "twisted" 
in a K-dependent way. 

Within PBCs the tensor F a p(0) cannot be simplified — as e.g. in Eq. (6.5) — 
and must be therefore addressed in its original form by actually evaluating the 
k derivatives: 

^a/j(K) = (0a*o(K)|O(K)|fy*o(K)>- (6-7) 

As within OBCs, even within PBCs this tensor is extensive. In the thermody- 
namic limit (i.e N — > oo, keeping N/L 3 constant), the well defined quantity is 
Fap{0)/N. We also observe that for time-reversal symmetric systems .F aj g(0) is 
real symmetric, and coincides therefore with the metric tensor g a p(0)- 

6.2.3 Sum over states again 

We express (r a rp} c using the sum-over-states formula, Eq. (3.28) : 

Mc = N^ Q {E^E^ ' (6 ' 8) 

The K-derivative of the Hamiltonian of Eq. (6.1) is 

JV 



VK^(0) = ^E[p* + ^ A ( r *)]. (6-9) 



where the rhs is nothing else than h times the velocity operator v; Eq. (6.8) 
becomes then 

, r . _ 1 W <goKjgn)<gn|«gjgo) 

{a0)c h^N^ (E -E n y 

1 W <goKjgn)<gnN2o) , filn . 

~ tu 2 n ' [ ' 

where uj 0n = (E n - E )/h. 

The velocity operator is also commonly expressed as v = i[H (0), r]/h, but 
it is worth emphasizing that while the position f is well defined within OBCs 
and ill defined within PBCs, the velocity v is well defined in both cases. 

The basic sum-over states formula, Eq. (6.10), applies therefore to both 
OBCs and PBCs. In general, it is not very useful on practical grounds, since it 
would require the evaluation of slowly convergent sums. Nonetheless, Eq. (6.10) 
is instead essential to gather understanding into the physical meaning of T a (i, 
as will be shown below. 



6.3 Geometrical theory of the insulating state 
6.3.1 Fundamentals 

The modern formulation of the theory of the insulating state is based on a 
localization tensor (squared localization length in Id), first introduced by Resta 
and Sorella in 1999 [129]. This work was followed soon afterwards by SWM 
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[130], where additional results are found, most notably relating localization to 
conductivity. Since then, most authors (including the present one) have adopted 
the notation (r a rp) c for the localization tensor, where "c" stays for cumulant. 
The formulation within OBCs, using the language and the notations familiar in 
quantum chemistry, dates since 2006 [106]. 

The tensor (r a rp) c has the dimensions of a squared length; it is an inten- 
sive quantity that characterizes the ground-state many-body wavefunction as a 
whole. Its key virtue is that it discriminates between insulators and metals: it is 
finite in the former case and divergent (in the large-system limit) in the latter. 
This is the main message of the present Chapter (and of the modern theory of 
the insulating state): we are going to prove it below, Sect. 6.4 

Several expressions for the localization tensor have been given in the lit- 
erature, all of them equivalent; here we define the localization tensor as the 
intensive quantity 

(r a rp) c = T af} (Q)/N, (6.11) 

where the thermodynamic limit is understood. A glance at the OBCs expression, 
Eq. (6.5), shows the cumulant nature of T and explains the reason for the 
notation, which we adopt within PBCs as well. 

Until 2005 the theory of the insulating state implicitly addressed time- 
reversal invariant systems only, where the T tensor is real symmetric, and 
coincides with the metric: in such systems therefore 

(r a r p ) c = g af3 (0)/N. (6.12) 

It was found in 2005 [131] that — in absence of time-reversal symmetry and 
within PBCs — the tensor (r a rp) c is naturally endowed with an antisymmetric 
imaginary part, whose physical meaning is also outstanding. This is discussed 
in Sects. ?? and ??. 



6.3.2 Linear response 

So far, we have only discussed ground-state properties of our iV-electron sys- 
tem. Suppose now that it is subject to a small time-dependent perturbation 
contributing to the Hamiltonian the term: 

5H(t) = —j duj f(u))-(Aer iut + iV wt ), (6.13) 

where A determines the "shape" of the perturbation and / its amplitude. In 
order to get an Hermitian 5H, we assume f(ui) = f(—uj). We wish to measure 
the response to such perturbation by means of the expectation value of some 
observable B, i.e.: 

5(B) = <*|B|*> - <*o|B|*o>, (6-14) 

where ^ = "to + 5&(t) is the perturbed time-evolved ground state. If we limit 
ourselves to study terms which are linear in the response, it is enough to consider 
the single oscillatory perturbation: 

H'(w) = i(ie" ia " + Ate*"*), (6.15) 
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whose response can be written, using the compact notations due to Zubarev 
[64, 132, 133], as: 

6(B) = 1( ((B\A)U-^ + ((B\A))_^ ). (6.16) 

The quantity ((B\A))u is by definition the linear response induced by the per- 
turbation A at frequency u> on the expectation value (B). Straightforward 
first-order perturbation theory provides its explicit expression as: 



ft ^0+ ^ I UJ-LUon+iT] 

(* |4*»><*»|B|*oA 



u + u 0n + i-q 



(6.17) 



where wo„ are the excitation frequencies of the unperturbed system, and the 
positive infinitesimal r\ ensures causality. Expressions of the kind of Eq. (6.17) 
go under the name of Kubo formulae. 

6.3.3 Conductivity 

The conductivity tensor a a p(oj) measures the current linearly induced by an 
electric field: j a — a a pSp. We therefore identify A with the potential of an 
electric field along (3, i.e. A = e£fp, and B with the current operator —ev a /L 3 . 
An important detail must be stressed at this point. The macroscopic field 
inside the sample includes by definition screening effects due to the electronic 
system, while the perturbation 6H entering Eq. (6.1) — via the A operator — is 
the "bare", or unscreened one. This point will be discussed below (Sect. 6.3.5); 
for the time being we simply identify screened and unscreened fields. 
The Kubo formula for conductivity is therefore 

e 2 

0- a/3 (w) = -jp{{v a \rp)) u ; (6.18) 

this is correct within OBCs, but meaningless within PBCs, owing to the explicit 
presence of the position operator. This, however, makes no harm, since only 
its off-diagonal matrix elements are required: see Eq. (6.17). As usual, we may 
exploit the identity (^olfl^n) = 'o|v|^ '„) / 'u>on- The Kubo formula becomes 
then 

a a 3{uj) = ——t um > ! — 

HL 6 >7^0+ LU „ V ^ - U 0n + IT] 

lu + u> 0n + ir] J ' 

We introduce a compact notation for the real and imaginary parts of the 
numerators in Eq. (6.19), i.e. 

= Re (*o|«a|*n><* n |«/j|*0>, (6-20) 
2n,a/8 = Im<*o|t>a|*n><*n|t)/9|*0>, (6-21) 
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which are symmetric and antisymmetric, respectively. Using then 

lim —^=V--in5(x), (6.22) 
r^o+ x + iri x 

and omitting the principal part, we separate for uj > the symmetric and 
antisymmetric parts in the conductivity tensor as 



r, (+)/ \ " e ,v n,a(3 <-/ x 



6.3.4 Sum rules 



At this point, we are ready to compare with the sum-over-states formulae for 
the tensor (r a rp) c . In the present notations, we rewrite Eq. (6.10) as 



Re {r a r p ) c = — ^ — — 

Im (r Q r,) c = ^E'^" ^ 24 ) 



A glance at Eq. (6.23) shows that 

Re (^)c = —Be (6.25) 



Im^r^c = ^Reai^O). (6.26) 



Eq. (6.25) has been arrived at by SWM in 2000 [130], and Eq. (6.26) by Resta 
in 2005 [131]. First of all, these identities show that {r a rp) C) defined here as a 
basic geometric feature, is indeed a measurable quantity (whenever it does not 
diverge) . 

As emphasized throughout this work (r a rp) c is a ground-state property, 
while the rhs of Eqs. (6.25) and (6.26) are properties of the system excitations, 
owing to the Kubo formula. Indeed, both Eqs. (6.25) and (6.26) look like the 
zero-temperature limit of a fluctuation-dissipation theorem, several forms of 
which are known in statistical physics [134, 135]: in the lhs we have a ground- 
state fluctuation — see in particular Eq. (6.5) — while the ingredient of the rhs is 
conductivity (dissipation) . 



6.3.5 Screened vs. unscreened field 

The Kubo formula for conductivity has been obtained identifying the A operator 
with £fp, where £ is the macroscopic field inside the sample. In general this 
is not quite correct, since instead A = e£ofp, where £q is the "bare" field, i.e. 
the field that would be present inside the sample in absence of screening. The 
latter originates from the two-body (electron-electron) terms in the potential V 
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entering Schrodinger equation. The relationship between £ and £q is not a bulk 
property, and depends on the shape of the sample. Alternatively, it depends on 
the boundary conditions assumed for integrating Poisson equation: we refer to 
Ref. [8] for a thorough discussion. Whenever £ ^ £ , the sum rule in Eq. (6.25) 
must be modified. 

The ground-state fluctuations, as e.g. Eq. (6.5), are fluctuations of the 
macroscopic polarization, which in a finite sample induce a surface charge at 
the boundary. This in turn generates a depolarizing field, which counteracts 
polarization. Therefore the localization tensor (r a rp) c depends on the sample 
shape, or equivalcntly on the boundary conditions assumed when taking the 
thermodynamic limit. The choice of PBCs in Eq. (6.1), however, implies £ — £q 
[136]; hence Eqs. (6.25) and (6.26) are correct as they stand within PBCs. This 
no longer holds within OBCs: in this case Eq. (6.25) needs to be modified, while 
Im (r a rp) c = 0. 

Ideally the equality £ = £q corresponds to choosing a sample in the form 
of a slab, and to addressing the component of the fluctuation tensor (r Q r^) c 
parallel to the slab [8]; the thermodynamic limit amounts then to the infinite 
slab thickness. Owing to the long range of Coulomb interaction the order of 
the limits (first a slab, then its infinite thickness) is crucial. For instance, if 
the limit is taken instead by considering spherical clusters of increasing radius, 
the SWM fluctuation-dissipation sum rule, Eq. (6.25), assumes a different form: 
this is discussed in Refs. [106, 136]. The explicit form of the generalized sum 
rule is given therein. 

Last but not least, the effect leading to £ ^ £ within OBCs is a pure 
correlation effect. It originates from explicitly correlated wavefunctions, and 
does not occur within mean-field theories (Hartree-Fock and Kohn-Sham) [106, 
136]. Within such theories, therefore, the sum rules hold in the simple form 
of Eqs. (6.25) and (6.26); the conductivity therein is the independent-particle 
conductivity ("uncoupled" response in quantum-chemistry jargon). 

6.4 Localization in the insulating state 

The basic tenet of the modern theory of the insulating state is that the localiza- 
tion tensor (r a rp) c is the ground-state property which sharply discriminates — 
in the spirit of Kohn's seminal work [126, 127] — between insulators and metals. 
The real part of (r a rp) c remains finite in the thermodynamic limit in any insu- 
lator, while it diverges in any metal. 

The theory is very general, and has found applications to various different 
kinds of insulators: band insulators [137, 138, 139, 140]; correlated (i.e. Mott) 
insulators, either by means of Hubbard-likc model Hamiltonians [129, 141] or 
realistic ones [142, 143]; and Anderson insulators [144]. Some of these applica- 
tions are reviewed in Sect. 6.5. The cases of Chern insulators, quantum Hall 
insulators, and topological insulators are discussed in Chap. ??. 

The ultimate proof of the key property of Re (r a r^) c is based on the SWM 
sum rule, Eq. (6.25). Since the tensor is real symmetric, it is enough to consider 
the diagonal elements (over its principal axes) 




(6.27) 
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The /-sum rule yields 



/ cLu Re a aa (uj) 
Jo 



4 _ Ke 2 N 
8~ ~ 2m e L 3 ' 



(6.28) 



where w p is the plasma frequency. Therefore the integral in Eq. (6.27) always 
converges at oo; its convergence/divergence is dominated by the small-w behav- 
ior of Re a aa (uj). 

Suppose first that the spectrum is gapped, i.e. the spacing between the 
ground state and the first excited state stays finite in the thermodynamic limit. 
If the gap is E g the conductivity vanishes for oj < E g /h, and Eq. (6.28) yields 



This inequality is due to SWM and clearly proves that Re (r a r,g) c is finite in 
any gapped insulator, as e.g. band insulators (considered in more detail in Sect. 



The main message of Kohn's 1964 paper, however, is that "insulating char- 
acteristics are a strict consequence of electronic localization (in an appropriate 
sense) and do not require an energy gap" . For any gapless material, the small-w 
behavior of Re o~ aa (uj) is the result of a competition between numerators and 
denominators in the Kubo formula, Eq. (6.19). Since we aim at a continuous 
function of o>, the singularities in Eq. (6.23) must be smoothed: this can be done 
by keeping the "dissipation" 77 finite while performing the thermodynamic limit 
first [145]. For a band metal the localization tensor diverges (see below). Ac- 
cording to SWM, a gapless material is insulating whenever Re a aa {oj) — > like 
a positive power of u>, and metallic otherwise. The only example of gapless in- 
sulator considered so far is a model Anderson insulator in Id [144]. Simulations 
prove indeed that (x 2 ) c is finite therein (Sect. 6.5.4). 

6.4.1 Independent electrons 

For noninteracting electrons the potential V in Eq. (6.1) is the sum of identical 
one- body terms: V = Ei=i^( r 0- Th e many-electron Hamiltonian is separa- 
ble and the exact ground state \^o(k)) is a Slater determinant of one-particle 
orbitals (doubly occupied in the singlet case). At a mean- field level, the one- 
body potential V(r) includes electron-electron interaction in a selfconsistent 
way. In the Hartree-Fock (HF) framework the Slater determinant is regarded as 
an approximate many-electron wavefunction. Instead, in the density-functional 
framework the orbitals — called Kohn-Sham (KS) orbitals — are auxiliary quan- 
tities, individually devoid of physical meaning. In particular, their Slater de- 
terminant does not coincide with the many-electron wavefunction |^I'o( K )) J a s a 
matter of principle. Therefore the exact localization tensor docs not coincide, 
at least in principle, with the one obtained from the Slater determinant of KS 




(6.29) 



2m e E t 



6.4.2). 
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orbitals; the issue is similar to the one discussed above for macroscopic polariza- 
tion P (Sec. 5.4.2). We stress, however, a difference: the exact P coincides with 
the KS P within OBCs, and the problem occurs only within PBCs. Instead, 
the exact localization tensor differs in principle from the KS one within both 
OBCs and PBCs. 

So much for the matters of principle. On practical grounds such difference 
is routinely disregarded (e.g. when dealing with polarization, magnetization 
[5, 6, 8], and more), given that it is not at all clear what is the relative im- 
portance of this "intrinsic" error, compared with the errors due to the choice 
of the functional itself. We therefore address here either the HF or the KS 
wavefunction |4 , ('«)) J having the form of a Slater determinant. 

Whenever the wavefunction is a Slater determinant, all ground-state prop- 
erties can be explicitly cast in terms of the one-body density matrix 

N/2 

p(r,r') = 2P(v,v') - 2^^(r)^(r'), (6.30) 

where a singlet ground state is assumed, and ipj (r) are the occupied one-particle 
orbitals (either HF or KS); P(v, r') is the projector over the occupied manifold. 

The expression for the localization tensor is easily found within OBCs start- 
ing from Eq. (6.5) [106]: 

(r a r p ) c = 1 J dvdv' (r - r%(r - v% |P(r, r')| 2 . (6.31) 

If we define the complementary projector 

Q(v,v') = 5(v-r')-P(v,v'), (6.32) 
an equivalent expression is [137] 

{r a r fj ) c = ^Tr {r a Pr p Q}, (6.33) 

where "Tr" is the trace over the single-particle Hilbert space (not on Cartesian 
indices). 

If we consider a cluster, cut out of a crystalline solid, Eq. (6.31) becomes in 
the large- N limit 

(r a r p ) c = ±r f dv [ dv' (v-v') a (v-T')p\P(v,v')\\ (6.34) 

J cell J all space 

where N c is the number of electrons per crystal cell. According to the discussion 
in Sec. 6.3.5, we need not to worry about shape issues in taking the limit; we 
also notice that the density matrix, Eq. (6.30), is independent of the boundary 
conditions (either OBCs or PBCs) in the large- N limit. 

6.4.2 Band insulators and band metals 

As observed, Eq. (6.34) holds for a crystalline solids. Therefore the inner integral 
on the rhs must converge for a band insulator, and must diverge for a band metal. 
This is confirmed by the well known fact that the asymptotic behavior of P is 
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qualitatively different in insulators and in metals. In the former materials, 
in fact, P(r, r') decays exponentially [146, 147, 148, 149] for large values of 
r — r': therefore the integral converges and the localization tensor is finite. 
In conducting materials, instead, P(r,r') decays only polynominially, and the 
inner integral diverges. This divergence can be explicitly verified for the simplest 
conductor of all, namely, the noninteracting electron gas, whose density matrix 
is exactly known in analytic form [137, 150]. Therefore the localization tensor, 
when expressed in the form of Eq. (6.34), measures in a perspicuous way the 
"nearsightedness" [151] of the electron distribution. Such measure is qualitatively 
different in insulators and in metals. 

The one-particle orbitals (either HF or KS) in a crystalline solid have the 
Bloch form. We therefore may wish to replace the orbitals in the expression for 
P(r,r'), Eq. (6.30), 

<^(r) -> Vik(r) = c 4k ' r ^ k (r), (6.35) 

where j is the band index and k is the Bloch vector. We stress that PBCs are 
at the very root of Bloch theorem. If the orbitals are normalized to one over 
the crystal cell of volume V ce \\, the ground-state projector in insulating crystals 
is 

p(ry) = (^E^f v ' Jk(r) ^ k(r ' ) (6 ' 36) 

= |f E/ B f e< k ' (r - r V(rK k (r'), 

where n — N c /2 is the number of occupied bands, and the integral is taken over 
the Brillouin zone (equivalently, over the reciprocal cell). 

Using Eq. (6.36), the localization tensor in a band insulator becomes (for 
double occupancy) proportional to BZ integral of the Bloch metric-curvature 
tensor, Eq. (3.47), i.e. 

(r a r p ) c = 7^/ B f«k) 

n n 

Tapik) = ^(d a Ujk\dpUjk) (d a Ujk\uj>k)(uj/k\dpUjk). (6.37) 
j=i jj'=i 

The proof is given in Refs. [4, 137], and will not be repeated here. 

Within OBCs the localization tensor is always real: this is perspicuous 
in Eqs. (6.31) and (6.34). Instead the imaginary part of the PBC (r a rp) c , 
Eq. (6.37), is the BZ integral of the Bloch Berry curvature, discussed below, 
Sec. xxxx. 

6.4.3 Wannier functions 

Expressions similar to Eq. (6.37) also enter the Marzari-Vanderbilt theory of 
maximally localized Wannier functions [103]; the relationship is 

d 

y> Q ra>c = -fti. (6-38) 
^-^ n 
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Here f2i indicates the gauge-invariant part of the quadratic spread of the Wan- 
nier functions, as in Ref. [103] and in the subsequent literature. The trace in 
Eq. (6.38) is a lower bound (and not a minimum in dimension d > 1) for the 
spherical second (cumulant) moment — a.k.a. quadratic spread — of the Wannier 
functions, averaged over the sample. 

We stress that Eqs. (6.36) and (6.37) make sense only insofar the Fermi level 
falls in a gap, in which case (r a rp) c is always finite. If we vary the Hamiltonian 
continuously, allowing the gap to close, then (r a rp) c diverges; the quadratic 
spread of the Wannier functions diverges as well [106]. 

6.5 Localization in different kinds of insulators 

6.5.1 Small molecules 

The modern theory of the insulating state clearly addresses extended sys- 
tems, i.e. the N — > oo limit; indeed it makes little sense to ask whether a 
small molecule is insulating or conducting. Nonetheless the concepts of lo- 
calized/delocalized electronic states is of the utmost importance in quantum 
chemistry as well, notably in relationship to aromaticity. 

The tensor (r a rp) c within OBCs is always real symmetric. If the ground- 
state wavefunction is a Slater determinant, then the trace of the tensor at finite 
N has the meaning of a lower bound for the quadratic spread of the Boys 
localized orbitals, averaged over all the occupied orbitals [106]. 

The small- TV version of the main concepts of the present review (in their 
OBCs flavour [106]) has been adopted in quantum chemistry by Angyan 
[152, 153]. Besides providing HF calculations of (r a r ( g) c for a sample of small 
molecules, Angyan even provides experimental values drawn from compilations 
of the dipole oscillation-strength distributions: basically, from Eq. (6.25). 

6.5.2 Band insulators 

The theory warrants that the localization tensor is finite in any insulator. How- 
ever, quantitative calculations for both model tight-binding Hamiltonians and 
realistic solids within density-functional theory have been used to illustrate the 
theory and to identify trends. For instance one expects much smaller diagonal 
elements (r a r a ) c for strong (i.e. large-gap) insulators than for weak (small-gap) 
ones. This is also suggested by the SWM inequality, Eq. (6.29). 

Let us start with a simple tight-binding (a.k.a. Hiickcl) Hamiltonian in Id: 

H = {-iy\c) a c ia t{c] a c 1+la + H.c.) ] (6.39) 

3" 

where t > is the first neighbor hopping (Ji = —t in most chemistry literature) 
and H.c. stays for Hermitian conjugate. This toy model schematizes a binary 
ionic crystal; the band structure is 

e(q) = ±v/A 2 + 4i 2 cos 2 qa/2, (6.40) 

where a is the lattice constant and q is the Bloch vector. The gap is equal to 
2 A; at half filling the system is always insulating except for A = 0. The squared 
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Figure 6.1: Diagonal element of 
the KS localization tensor vs. 
the inverse direct gap (theoret- 
ical and experimental), for sev- 
eral elemental and binary semi- 
conductors (from Ref. [137]) 
The points corresponding to Si 
and Ge with the theoretical 
gaps are out of scale. From Ref. 
[137]. 



localization length (within OBCs) is the tight-binding version of Eq. (6.31), i.e. 

2 N 

&>° = h E PfAj-ff- (6-41) 

This is a monothonical function of t/A; it is easily verified that it vanishes in 
the extreme ionic case (t = 0). In the metallic case (A = 0) the ground-state 
projector has a simple analytical form: 

Pjj = tj; Pjj' = for even \f - j\ = 2s, 

Pjj ' = 7r(2~ S + l) f ° r ° dd \?-i\= 2s + 1 ' ( 6 - 42 ) 

which clearly implies divergence of Eq. (6.41). At any finite N within OBCs 
Eq. (6.41) leads to a finite (a; 2 ) c value; however Eq. (6.42) suggests that in the 
metallic case (x 2 ) c diverges linearly with N. This has been verified by actual 
simulations, even when A^O but the Fermi level is not in the gap [144]. 

Other simulations [140] have addressed dimerized chains, i.e. A = but 
alternant hoppings in Eq. (6.39). While nothing relevant occurs within PBCs, 
partly filled end states within OBCs at some fillings are at the root of some 
noticeable features. 

The first ab-initio study (in 2001) addressed several elemental and binary 
cubic semiconductors at the KS level [137]. The tensor is real and isotropic. 
The computed (x 2 ) c (Fig. 6.1) is smaller than 3 bohr 2 in all the materials 
studied: the ground many-body wavefunction is therefore very localized in this 
class of materials. The SWM inequality was also checked, and found to be well 
verified using both the theoretical KS gap and the experimental one (the latter 
is typically larger). 

Other studies have addressed the ferroelectric perovskites in their different 
(cubic and noncubic) structures [138], and some model Hamiltonians in Id and 
2d [139]. 
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Figure 6.2: Squared localiza- 
tion length for the Hamiltonian 
in Eq. (6.43) at half filling for 
t/A = 1.75. The system un- 
dergoes a quantum phase tran- 
sition from band-like insulator 
to Mott-like insulator at U/t = 
2.27. From Ref. [129]. 



6.5.3 Correlated (Mott) insulators 

Starting from the noninteracting Hamiltonian of Eq. (6.39) and augmenting it 
with an on-site repulsive term we get the two-band Hubbard model 



H = Y,[{-iy^c) a c jrT -t(c) a c j+ i a +H.c.)] + U^njinjl- (6-43) 



The explicitly correlated ground-state wavefunction has been found by exact 
diagonalization [129], and the corresponding (a; 2 ) c has been computed as a 
function of U for fixed t/A = 1.75. The results are shown in Fig. 6.2 in 
dimcnsionlcss units; it turns out that there is only one singular point U = 2.27£, 
where (x 2 ) c diverges. Indeed, it has been verified that at such value the ground- 
state becomes degenerate with the first excited singlet state, i.e. the system is 
metallic. The singular point is the fingerprint of a quantum phase transition: 
on the left we have a band-like insulator, and on the right a Mott-like insulator. 
The two insulating states are qualitatively different; by adopting the modern 
jargon, nowadays we could say that they are topologically distinct. The static 
ionic charges (on anion and cation) are continuous across the transition, while 
the dynamical (Born) effective charge on a given site changes sign [154]. Other 
studies of the localization tensor within the same Hubbard model can be found 
in Ref. [155]. 

The transition from a band metal to a Mott insulator has been studied in 
a model linear chain of Li atoms by Vetere et al. [142]. At a mean-field level 
the infinite chain is obviously metallic at any lattice constant a, since there 
is one valence electron per cell. However the mean-field description becomes 
inadequate at large a, where the electrons localize and the system becomes a 
Mott insulator. If electron correlation is properly accounted for at any a, the 
system undergoes a sharp metal-insulator transition at a critical a. 

The calculations addressed linear Lijv systems (N up to 8) within OBCs, 
where the finite size prevents a sharp transition; the tradeoff is that full config- 
uration interaction was affordable with 6 atomic orbitals per site (yielding more 
than 10 9 symmetry-adapted Slater determinants). The wavefunction of Vetere 
et al. is therefore exempt from any bias insofar as the treatment of correlation 
is concerned, although its quality is determined by the basis set. A study of the 
longitudinal component (x 2 ) c of the localization tensor indicates rather clearly 
the occurrence of the metal-insulator transition at a ~ 7 bohr; other indicators 
give concordant results [142]. For comparison, the nearest- neighbour distance 
in 3d metallic lithium is 5.73 bohr. 
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Figure 6.3: Density of 
states (arbitrary units) for 
a model binary alloy in Id. 
The crystalline (band) case 
corresponds to the Hamilto- 
nian of Eq. (6.39) with A = 
0.25 and t = 1. The disor- 
dered (Anderson) case cor- 
responds to a random choice 
of the anion/cation distri- 
bution. 



More recently Stella et al. [143] have performed variational quantum Monte 
Carlo studies of hydrogen chains, up to 66 atoms, within PBCs. The crossover 
between the weakly correlated (band) metallic regime — at short distances — 
and the strongly correlated (Mott) insulating regime — at large distances — has 
been thoroughly investigated by means of (x 2 ) c . The Mott transition occurs at 
a ~ 3.5 bohr. 

6.5.4 Disordered (Anderson) insulators 

We start from the same Hamiltonian as in Eq. (6.39), and we replace the ordered 
string (— iy by a random string of ±1, chosen with equal (and uncorrelated) 
probability. This system models a random binary alloy at 50% concentration. 
It is well known both from analytical arguments and actual simulations that its 
spectrum is gapless [125, 156]. The density of states for both the ordered and 
disordered systems are shown in Fig. 6.3, and confirm the expected features. 
The band structure of Eq. (6.40) yields obviously a gapped density of states; at 
the band edges it shows van Hove singularities, which in Id have the character 
of l/v^ divergences. As discussed above, the system is insulating at half fill- 
ing and conducting otherwise. The disordered system, instead, is gapless and 
nonetheless insulating at any filling. In fact, this model Hamiltonian describes 
a paradigmatic Anderson insulator in Id. 

The conventional theory of transport focusses on the nature of the one- 
particle orbitals at the Fermi level; in Anderson insulators these are localized, 
thus forbidding steady state currents [123]. More than fifty years of literature 
have been devoted to investigate Anderson insulators under the most diverse 
aspects. [125, 156, 157, 158]. 

At variance with such wisdom, a recent work has addressed this paradig- 
matic Anderson insulator from the nonconvcntional viewpoint of the modern 
theory of the insulating state [144]. In the spirit of Kohn's theory the invidual 
Hamiltonian eigenstates become apparently irrelevant, while the focus is on the 
many-electron ground state as a whole. The squared localization length (x 2 ) c 
has been computed within OBCs from Eq. (6.41), and found to be finite, as 
expected. Nonetheless its value is about 20 times larger than the one for the 
band insulator, at the same value of the parameters (i.e. A = 0.25,i=l). This 
reflects the fact that the scattering mechanisms are profoundly different: inco- 
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herent (Anderson) versus coherent (band). In the latter case, the Hamiltonian 
eigenstates are individually conducting but "locked" by the Pauli principle if the 
Fermi level lies in the gap. 
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